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The present invention relates to gene therapy, especially to adenovirus-based gene therapy, and related cell lines and compositions. 
In particular, novel packaging cell lines are disclosed, for use in facilitating the development of high-capacity vectors. The invention also 
discloses a variety of high-capacity adenovirus vectors and related compositions and kits including the disclosed cell lines and vectors. 
Finally, the invention discloses methods of preparing and using the disclosed vectors, cell lines and kits. 



Codes used to identify 

AL Albania 

AM Armenia 

AT Austria 

AU Australia 

AZ> Azerbaijan 

BA Bosnia and Herzegovina 

BR Barbados 

BE Belgium 

BF Burkina Faso 

BG Bulgaria 

BJ Benin 

BR Brazil 

BY Belarus 

CA Canada 

CF Central African Republic 

CG Congo 

CH Switzerland 

Cl Cote d'lvoire 

CM Cameroon 

CN China 

CU Cuba 

rjZ Czech Republic 

DE Germany 

DK Denmark 

EE Estonia 



FOR THE PURPOSES 
States party to the PCT on the front 



OF INFORMATION ONLY 

, of pamphlets publishing international applications under the PCT. 



ES 


Spain 


Ft 


Finland 


FR 


France 


GA 


Gabon 


GB 


United Kingdom 


GE 


Georgia 


CH 


Ghana 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


II- 


Israel 


IS 


Iceland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 



KG 
KP 

KR 
KZ 
LC 
U 
LK 
LR 



Kyrgyzstan 
Democratic People's 
Republic of Korea 
Republic of Korea 
Kazakstan 
Saint iJicia 
Liechtenstein 
Sri Lanka 
Liberia 



pages ( 

LS 

LT 

LU 

LV 

MC 

MD 

MG 

MK 



Lesotho 

Lithuania 

Luxembourg 

Latvia 

Monaco 

Republic of Moldova 

Madagascar 

The former Yugoslav 

Republic of Macedonia 



ML 


Mali 


MN 


Mongolia 


MR 


Mauritania 


MW 


Malawi 


MX 


Mexico 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SG 


Singapore 



SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


sz 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TM 


Turkmenistan 


TR 


Turkey 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


zw 


Zimbabwe 



WO 98/13499 



PCT/EP97/05251 



-1 - 

PACKAGING CELL LINES FOR USE IN FACILITATING THE DEVELOPMENT OF HIGH-CAPACITY 

ADENOVIRAL VECTORS 

This invention was made with U.S government support under NIH Grant No, HL 54352. The 
government has certain rights in the invention. 

The present invention relates to gene therapy, especially to adenovirus-based gene 
therapy. In particular, novel packaging cell lines are disclosed, for use in facilitating the 
development of high-capacity vectors. High-capacity adenovirus vectors are also disclosed 
herein, as are related compositions, kits, and methods of preparation and use of the disclosed 

vectors, cell lines and kits. 

Enhanced transfer of DNA conjugates into cells has been achieved with adenovirus, a 
human DNA vims which readily infects epithelial cells (Horwitz, "Adenoviridae and their 
replication", in Virology . Fields and Knipe, eds., Raven Press, NY (1990) pp. 1679-1740). 

Although adenovirus-mediated gene therapy represents an improved method of DNA 
transfer into cells, a potential limitation of this approach is that adenovirus replication results in 
disruption of the host cell. In addition, adenovirus also possesses oncogenic properties 
including the ability of one of its proteins to bind to tumor suppressor gene products. The use 
of so-called replication defective strains of adenovirus (which typically possess El A and/or 
E1B deletions that render the virus unable to replicate in host cells) is in principle more suitable 
for in vivo therapy; however, the potential of co-infection of epithelial cells with wild-type 
strains of virus resulting in transactivation of the recombinant virus may represent a significant 
safety concern for in vivo applications. Furthermore, it is not yet known which recombinant 
adenoviruses are capable of integrating their genome into host cell DNA allowing for 
long-term stable expression of any foreign genes they may be transporting. 

Another undesirable aspect of using intact or replication-competent adenovirus as a 
gene transfer means is that it is an oncogenic virus whose gene products are known to interfere 
with the function of host cell tumor suppressor proteins as well as immune recognition 
molecules, such as the major histocompatibility complex (MHC). In addition, pre-existing 
circulating antibodies to adenovirus may significantly reduce the efficiency of in vivo gene 
delivery. Lastly, only a foreign gene of 6 kilobases (kb) or less can be incorporated into the 
intact adenovirus genome for gene transfer experiments, whereas DNA segments of greater 
than 15 kb can be transferred using the methods of this invention. 
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invention may also be engineered to target the receptors of and achieve penetration of non- 
epithelial cells; means of engineering viral vectors to accomplish these ends are described in 
detail hereinbelow. 

Second, the vectors of the present invention have deletions of substantial portions of 
the Ad genome, which not only limits the ability of the Ad-derived vectors to "spread" to other 
host cells or tissues, but allows significant amounts of "foreign" (or non-native) nucleic acids 
to be incorporated into the viral genome without interfering with the reproduction and 
packaging of the viral genome. Therefore, the vectors of the present invention are ideal for use 
in a wide variety of therapeutic applications. 

Third, while the vectors disclosed herein are safe for use as therapeutic agents in the 
treatment of a variety of human afflictions, they do not require the presence of any "helpers" 
for propagation and packaging, largely because of the novel cell lines in which they are 
reproduced. Such cell lines - referred to herein as packaging cell lines - comprise yet another 

aspe.ct of the invention. 

To reduce the frequency of contamination with wild-type adenovirus, it is desirable to 
improve either the viral vector or the cell line to reduce the probability of recombination. For 
example, an adenovirus from a group with less homology to the group C viruses may be used 
to engineer recombinant viruses with little propensity for recombination with the Ad5 sequence 
in 293 cells. Similarly, an epithelial cell line - 293 or another - may be prepared according to 
within-disclosed methods which stably expresses adenovirus proteins or polypeptides from 
Ad3 and/or proteins or polypeptides from another non-group-C or group C serotype; such a 
cell line would is useful for supporting adenovirus-derived viral vectors bearing deletions of 
regulatory and/or structural genes, irrespective of the serotype from which such a vector was 
derived. 

It is also contemplated that the constructs and methods of the present invention will 
support the design and engineering of chimeric viral vectors which express amino acid residue 
sequences derived from two or more Ad serotypes. Thus, unlike methods and constructs 
available prior to the advent of the present disclosure, this invention allows the greatest 
possible flexibility in the design and preparation of useful viral vectors and cell lines which 
support their construction and propagation - all with a decreased risk of recombining with 
wild-type Ad to produce potentially-harmful recombinants. 
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g. polypeptide VII; 

h. polypeptide VHI; and 

i. biologically active fragments thereof is disclosed. 

In one variation, the sequences are constitutively expressed; in another, one or more 
sequences is under the control of a regulable promoter. In a preferred embodiment 
expression is constitutive. In various preferred embodiments, the polypeptides expressed by the 
DNA sequences are biologically active. 

In a further and preferred embodiment the packaging cell line of the present invention 
supports the production of a viral vector. In a preferred embodiment the viral vector is a 
therapeutic vector. 

In one aspect of the present invention, each DNA sequence is introduced into the 
genome of the within-disclosed cell lines via a separate complementing plasmid. In other 
embodiments, two or more DNA sequences were introduced into the genome via a single 
complementing plasmid. In one variation, the complementing plasmid comprises a DNA 
sequence encoding adenovirus fiber protein, polypeptide or fragment thereof. An example of a 
useful complementing plasmid according to the present invention is a plasmid having the 
characteristics of pCLF (for deposit details, see Example 3) . 

In another aspect of the present invention, the complementing plasmid used to 
transform a cell line of the present invention further comprises a DNA sequence encoding an 
adenovirus regulatory protein, polypeptide or fragment thereof. In one variation, the 
regulatory protein is selected from the group consisting of El A, E1B, E2A, E2B, E3, E4 and 
L4 (also referred to as "the 100K protein"); an exemplary complementing plasmid has the 
characteristics of is pE4/Hygro?? (for deposit details, see Example 3). In another aspect, the 
complementing plasmid used to transform a cell line of the present invention further comprises 
a DNA sequence encoding two or more of the above mentioned adenovirus regulatory 
proteins, polypeptides or fragments thereof. 

In one variation, the two or more regulatory proteins, polypeptides or fragments 
thereof are selected from the group consisting of El A, E1B, E2A, E2B, E3, E4 and L4 (also 
referred to as "the 100K protein"). In another variation, the structural protein is selected from 
the group consisting of penton base; hexon; fiber; polypeptide ma; polypeptide V; polypeptide 
VI; polypeptide VII; polypeptide VIE; and biologically active fragments thereof. 
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regulatory proteins (or portions thereof) that have been deleted. Thus, in one embodiment, the 
foreign DNA encodes a tumor-suppressor protein; a suicide protein; a cystic fibrosis 
transmembrane conductance regulator (CFTR) protein; or a biologically active fragment of any 
of them. 

Any of the within-disclosed cell lines may have a DNA sequence encoding all or part of 
a fiber protein - including modified or chimeric proteins ~ stably integrated into the genome, 
thus, in one variation, the fiber protein has been modified to include a non-native amino acid 
residue sequence which targets a specific receptor, but which does not disrupt trimer formation 
or transport of fiber into the nucleus. In one variation, the non-native amino acid residue 
sequence is coupled to the carboxyl terminus of the fiber. In yet another, the non-native amino 
acid residue sequence further includes a linker sequence. Alternatively, the fiber protein 
further comprises a ligand coupled to the linker. A suitable ligand may be selected from the 
group consisting of ligands that specifically bind to a cell surface receptor and ligands that can 
be used to couple other proteins or nucleic acid molecules. In one variation, the ligand is 
selected from the group consisting of ligands that specifically bind to a cell surface receptor 
and ligands that can be used to couple other proteins or nucleic acid molecules. 

In yet another embodiment, the non-native amino acid residue sequence is incorporated 
into the fiber amino acid residue sequence at a location other than one of the fiber termini. 
Alternatively, the non-native amino acid residue sequence alters the binding specificity of the 
fiber for a targeted cell type. In other embodiments, the linker sequence alters the binding 
specificity of the fiber for a targeted cell type. The expressed fiber may, in various 
embodiments, bind to a specific targeted cell type not usually targeted by adenovirus and/or 
may comprise amino acid residue sequences from more than one adenovirus serotype. 

In various aspects of the present invention, a packaging cell line of the present 
invention is derived from a procaryotic cell line; in another, it is derived from a eucaryotic cell 
line. While various embodiments suggest the use of mammalian cells, and more particularly, 
epithelial cell lines, a variety of other, non-epithelial cell lines are used in various embodiments. 
Thus, while various embodiments disclose the use of a cell line selected from the group 
consisting of 293, A549, W162, HeLa, Vero, 211, and 21 1 A cell lines, it is understood that 
various other cell lines are likewise contemplated for use as disclosed herein. 
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The invention further discloses complementing plasmids and methods of making same. 
In one embodiment, a complementing plasmid comprises a promoter nucleotide sequence 
operatively linked to a nucleotide sequence encoding an adenovirus structural polypeptide. In 
one variation, the complementing plasmid comprises pCLF. In another variation, a 
complementing plasmid further comprises a nucleotide sequence encoding a first adenovirus 
regulatory polypeptide, a nucleotide sequence encoding a second regulatory polypeptide, a 
nucleotide sequence encoding a third regulatory polypeptide; or any combination of the 
foregoing. In still another embodiment, the adenovirus structural polypeptide is selected from 
the group consisting of penton base; hexon; fiber; polypeptide ma; polypeptide V; polypeptide 
VI; polypeptide VII; polypeptide VIII; and biologically active fragments thereof. 

The present invention also discloses a complementing plasmid comprising a promoter 
nucleotide sequence operatively linked to a nucleotide sequence encoding an adenovirus 
structural protein, polypeptide or fragment thereof and a nucleotide sequence encoding an 
adenovirus regulatory protein, polypeptide or fragment thereof. In one variation, the early 
region polypeptide is E4; in another, the plasmid comprises pE4/Hygro. In still another 
variation, the early region polypepudes are El and E4. Complementing plasmids further 
comprising a nucleotide sequence encoding an adenovirus structural protein, polypeptide or 
fragment thereof are also contemplated, as are plasmids wherein the promoter nucleotide 
sequence is selected from the group consisting of MMTV, CMV and E4 promoter nucleotide 
sequences. 

Viral vectors are also disclosed which comprise nucleotide sequences encoding a 
packaging signal and a foreign protein or polypeptide, wherein the nucleotide sequence 
encoding an adenovirus structural protein has been deleted. In one variation, the nucleotide 
sequence encoding the foreign protein or polypeptide is a DNA molecule up to about 3 kb in 
length; in another, the nucleotide sequence encoding the foreign protein or polypeptide is a 
DNA molecule up to about 9.5 kb in length; in still another, the nucleotide sequence encoding 
the foreign protein or polypeptide is a DNA molecule up to about 12.5 kb in length. 
Nucleotide sequences of intermediate lengths are also contemplated by the present invention, 
as are sequences in excess of 12.5 kb. 

The invention also discloses viral vectors wherein the sequence encoding a foreign 
protein or polypeptide is a sequence encoding an anti-tumor agent, a tumor suppressor protein 
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further comprises a nucleotide sequence encoding a foreign polypeptide. In various 
embodiments, the polypeptide is a therapeutic molecule. 

Another embodiment discloses a composition as before, wherein the first delivery 
plasmid lacks adenovirus packaging signal sequences. In another aspect, the second delivery 
plasmid contains a LacZ reporter construct. Another variation discloses that the second 
delivery plasmid further lacks a nucleotide sequence encoding an adenovirus regulatory 
protein. In one variation, the regulatory protein is El. In one embodiment of the above-noted 
compositions, the complementing plasmid has the characteristics of pCLF. 

In another embodiment, a composition is disclosed wherein the first delivery plasmid 
lacks a nucleotide sequence encoding an adenovirus structural protein and the second delivery 
plasmid lacks a nucleotide sequence encoding adenovirus El protein. In another, the first 
delivery plasmid lacks a nucleotide sequence encoding adenovirus E4 protein and the second 
delivery plasmid lacks a nucleotide sequence encoding adenovirus El protein. In still another, 
the cell contains at least one complementing plasmid encoding an adenoviral regulatory protein 
and a structural protein. 

In alternative embodiments, the regulatory protein is E4 and the structural protein is 
fiber; or the regulatory protein is El and the structural protein is fiber. In still another 
embodiment, the adenoviral regulatory protein and the structural protein are encoded by 
separate complementing plasmids. 

Another variation discloses a composition wherein the cell is selected from the group 
consisting of 293, A549, W162, HeLa, Vero, 211, and 21 1 A. In another embodiment, the 
delivery plasmid is DV1, or p E1B gal, or p ElsplB. 

Various methods of making and using the vectors, plasmids, cell lines and other 
compositions and constructs of the present invention are also disclosed herein. The following 
methods are considered exemplary and not limiting. 

Thus, in one variation, the invention discloses a method of constructing therapeutic 
viral vectors, comprising introducing a delivery plasmid into an Ad fiber-expressing 
complementing cell line, wherein the DNA sequence encoding Ad fiber protein has been 
deleted from the delivery plasmid. In one variation, the delivery plasmid further includes a 
DNA sequence encoding a foreign protein, polypeptide, or fragment thereof. In other 
embodiments, the delivery plasmid is DV1, p E1B gal, p ElsplB, or similar constructs. 
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Ir-suppressor gene. ,n stil, another e„*odimen, dte agen, is a suicide gene or a fun 
e^enTrhereo, * anodner vaHatton. the vector is admtnisreted via inrr— ^eorton. 

The tnvemion also discloses systems 0, kits for use in any of the aforementtoned 
oredtod, The systems or kits may contain any appropriare combination of ft. wtthm- 
described vectors, plaamids. cell lines, and additional therapeutic agents as dtsdoseo. 
Preferably, each such tt or system includes a ,uan,iry of the appropriare thempeurtc substance 
or seq uence sufftcien, for a, lea,, one administration, and —ions for admintarmtton and 
use Tnus, one system furrher comprises an effeorive amonn, of . rherapeuric agent „h,h 
enhances the therapeu.ic effect of the therapeuric viral vecror-containtng oompostrton. 
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Another variation discloses that the composition and the therapeutic agent are each included in 
a separate receptacle or container. 

It will also be appreciated that any combination of the preceding elements may also be 
efficacious as described herein, and that all related methods are also within the scope of the 
present invention. 

K KTF.F DESCRIPT ION O F THE DRAWINGS 

Figure 1 is a schematic diagram of the entire adenoviral E4 transcriptional unit with the 
open reading frames (ORF) indicated by blocked segments along with the promoter and 
terminator sequences. The location of primers for amplifying specific portions of E4 are also 
indicated as further described in Example 1 A. 

Figure 2 is a schematic map of plasmid pE4/Hygro as further described in Example IB. 

Figure 3 is a schematic map of plasmid pCDNA3/Fiber as further described in Example 

IB. 

Figure 4 is a schematic map of plasmid pCLF as further described in Example IB. 

Figure 5 is a photograph of a Southern blot showing the presence of intact adenovirus 
E4 3.1 kilobase (kb) insert in the 21 1 cell line as further described in Example 1C. 

Figure 6 is a photograph of a Western blot showing labeled fiber protein detected under 
native and denaturing electrophoresis conditions as described in Example 1C. The 293 cells 
lack fiber while the sublines 21 1 A, 21 IB and 21 1R contain fiber protein detectable in 
functional trimerized form and denatured monomeric form. 

Figure 7 is a schematic map of plasmid pDEX/El as further described in Example ID. 

Figure 8 is a schematic map of plasmid pEl/Fiber as further described in Example 1F1. 

Figure 9 is a schematic map of plasmid pE4/Fiber as further described in Example 1F2). 

Figure 10 is a schematic illustration of linearized gal delivery 

plasmids for use in cotransfection and recombination to form a recombinant adenoviral vector 
having multiple adenoviral gene deletions. The plasmids and recombination event are more 
fully described in Example 2A. 

Figure 1 1 is a schematic of plasmid pi 1.3 as further described in Example 2A used in 
the construction of pDV44 delivery plasmid with plasmid p8.2. 
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Figure 12 is a schematic of plasmid p8.2 as further described in Example 2A used in the 
construction of pDV44 delivery plasmid with plasmid pi 13. 

Figure 13. Trin,ericstruct ure oftberecombir,antfiben293.211A,211B.or211Reellsas 

bdicated were metabolically labeled with ["^methionine, soluble protein extracts prepared, and fiber 
was h.munoprecipitated. A portion of the precipitated protein was dectrophoresed on an 8% SDS- 
PAGE gel under either semi-native or denaturing conditions. The positions of trimetic (T) and 
monomeric (M) fiber are indicated. As a control for electrophoreuc conditions, recombinant Ad2 fiber 
produced in baculovirus-infected cells was run under identical conditions and stained with Coomassie 
blue. 

Fig. 14. ComplcmentationofaEbermutantadenovirusbyflber-producingcells: Tbecelllines 
indicated (2xl0 6 cells per sample) were infected with the temperature- sensitive fiber mutant adenovirus 
H5ttl42 at 10 PFU/cell and incubated at either the permissive (32.5 « C. stippled bars) or the restrictive 
(39 J • C, solid ban) temperature. 48 hours post-infection, virus was isolated by freeze-thaw lysis and 
yields determined by fluorescent focus assay on SW480 cells. Each value represents the mean of 
duplicate samples, and the data shown is representative of multiple experiments. 
Fig. 15. fccorporationoftberecombmantAdS fiber iotoAdB particles: A) Alignment of the N- 
terrninal (penton base-binding) domains of fiber proteins from several different adenovirus serotypes. 
B) Type 3 adenovirus was propagated in 293, 211B, or 211R cells as indicated and purified by two 
sequential CsCl centrifugations. 10 mg of the purified viral particles was then electrophoresed uuder 
denaturing conditions and transferred to aPVDF membrane. Adi fiber was detected with a polyclonal 
rabbit antibody raised against recombinant Ad2 fiber. As a positive control for detection, 400 ng of 
wild-type Ad2 was run in the lane marked 'Ad2\ Under these conditions, the mobilities of the Ad2 and 
Ad5 fibers are indistinguishable and the antibody reacts with both proteins. 

Fig. 16. Nuclear localization of the recombinant fiber protein in three packaging cell lines: Cells 
were grown on 8-well chamber slides, stained with a rabbit anti-fiber polyclonal antibody and visualized 
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with a FITC-conjugated goat anti-rabbit antibody. 
293 cells (negative control). E) 293 cells infected 
post-infection (positive control). F) Infected cells 
antibody. 



A) line211A. B) Line211B C) Line211R D) 
with Ad.RSVbgal at 1 pfu/cell and stained 24 hour 
prepared as in .(E) but stained without the primary 



DFTATLED DESCRIPTION To reduce the frequency of contamination with wild-type 
adenovirus, it is considered desirable to improve either the viral vector or the cell line to reduce 
the probability of recombination. For example, an adenovirus from a group with less 
homology to the group C viruses may be used to engineer recombinant viruses with little 
propensity for recombination with the Ad5 sequence in 293 cells. Similarly, an epithelial cell 
line -- e.g. the cell line known as 293 -- may be used or further modified according to within- 
disclosed methods which stably expresses adenovirus proteins or polypeptides from Ad3 
and/or proteins or polypeptides from another non-group-C or group C serotype; such a cell 
line would be useful to support adenovirus-derived viral vectors bearing deletions of regulatory 
and/or structural genes, irrespective of the serotype from which such a vector was derived. 

It is also contemplated that the constructs and methods of the present invention will 
support the design and engineering of chimeric viral vectors which express amino acid residue 
sequences derived from two or more Ad serotypes. Thus, unlike methods and constructs 
available prior to the advent of the present disclosure, this invention allows the greatest 
possible flexibility in the design and preparation of useful viral vectors and cell lines which 
support their construction and propagation - all with a decreased risk of recombining with 
wild-type Ad to produce potentially-harmful recombinants. 

In part, the present invention discloses a simpler, alternative means of reducing the 
recombination between viral and cellular sequences than those discussed in the art. One such 
means is to increase the size of the deletion in the recombinant virus and thereby reduce the 
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^..rhoAdosanesm 293 .*.«*.,— "•-««• -vel can Bncsof rhepmacn. 



invention. 



at least 95% homology therewith. 

^ amino acid residues described herein are P«f«ab>, * *. 1" • *- " 

group prcsoit at aidBO of a polypeptide. 

BN^jjo^ A nucleic acid having a preselect conserved nucleoride sequence 
mi a sequence encoding a preferred polypepdde according - P— ■»—■ 

Edna* m. — h — » »■* a DNA mo!ecul£ nM proet " to * 

orienmion and posinon a, rue counrcrparr DNA nrolecnic found h. — , I, 

nay also refer ro a DNA moleeole from snorher Ad serorype or from an cndrel, drfferent 
species - e.g. a human DNA sequence. 



taga: The rerms "penron" or >mon complex" art preferentially used harem lo 

penron base, as we,, as penron complex The meaning of rhe rem >nron" alone should be 
clear from the context within which it is used. 
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Poly peptide and Peptide : These terms are used interchangeably herein to designate a 
series of no more than about 50 amino acid residues connected one to the other by peptide 
bonds between the alpha-amino and carboxy groups of adjacent residues. 

Receptor : Receptor is a term used herein to indicate a biologically active molecule that 
specifically binds to (or with) other molecules. The term "receptor protein" may be used to 
more specifically indicate the proteinaceous nature of a specific receptor. 

TiaPS gepg or Th ^a pentic Nucl eotide Sequence: As described and claimed herein, such 
a sequence includes DNA and RNA sequences encoding an RNA or polypeptide. Such 
sequences may be "native" or naturally-derived sequences; they may also be "non-native" or 
"foreign" sequences which are naturally- or recombinantly-derived. The term "transgene 
which may be used interchangeably herein with the term "therapeutic nucleotide sequence," is 
often used to describe a heterologous or foreign (exogenous) gene that is carried by a viral 
vector and transduced into a host cell. 

Therefore, therapeutic nucleotide sequences include antisense sequences or nucleotide 
sequences which may be transcribed into antisense sequences. Therapeutic nucleotide 
sequences (or transgenes) further comprise sequences which function to produce a desired 
effect in the cell or cell nucleus into which said therapeutic sequences are delivered. For 
example, a therapeutic nucleotide sequence may encode a functional protein intended for 
delivery into a cell which is unable to produce that functional protein. 

Expression or Delivery Vector : Any plasmid or virus into which a foreign DNA may 
be inserted for expression in a suitable host cell -- i.e., the protein or polypeptide encoded by 
the DNA is synthesized in the host cell's system. Vectors capable of directing the expression 
of DNA segments (genes) encoding one or more proteins are referred to herein as "expression 
vectors". Also included are vectors which allow cloning of cDNA (complementary DNA) 
from mRNAs produced using reverse transcriptase. 

Adenoviral Vertnr or Ad-Derived Vector : Any adenovirus-derived plasmid or virus 
into which a foreign DNA may be inserted or expressed. This term may also be used 
interchangeably with "viral vector" This "type" of vector may be utilized to cany nucleotide 
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„ encoding dterapeuuc proteins or ponces ,o stifle ce,,s or ce,. W s in . 
snbiect in need of ^ent, as described further hereintelo*. 

osed . tnt^r^Toride seances fnso . pacing ce,, Une, whh me — of 

having said sequences sutbly inregrare into me cellular genome. 

tviJUn* Tbis.ennisgene.iiynsedbereinrodescribeap.asnr.dvecrord,, 

caI ries or debvera nucleotide sequences in or inro a ce,, Une (eg., a packaging ce„ .ine) for me 
purpose of propagaring rherapeuric virai vecrors of me present invention. 

The adenovirus ,Ad, particle ,s redely comp,ex and may be resolved into various 
„res. The our., she,, is suiting,, icosanedra, in abape and, 

, of 95 The structures at the fivefold positions ( pentons ) are 

have a triangu ation number of 23. inesirui-iui 

„ Tbus, rhe structure re* does nor correspond ro a simp,e sub-mangu aed 
iMs ahedra, desrgn. (See, e.g.. F.e,ds. or a... Stake. Vol. I. Raven Press. NY. pp. 54-56 
(1990).) 

Fiber plays a crucial role in adenovirus infecrion by anaching the virus to a specific 
^,0, en me ce„ surface. Tbe fiber consisu of tbrec domains-, an N-rermitral ui, mat 
im erecs vritb penton base; a abaft composed of 22 repeats of a ,5.amino-ac,d segment d, 
foms .bee. and -bends; and a hnob a, me C-temtinus ma, contains me type-spectfic anttgen 
* ,s responsible for binding ,0 me cell surface receptor. Tbe fiber proteinase respond* 
for uanspor. o, vita, nuCetc acds into me nucleus. The gene encoding .he fiber protetn from 
Afi has been expressed in human cells and has been shown ,0 be correctly assembled mm 
trim er, glycosylated and transported to the nucleus. (See. e.g.. Hong and Engler, Magy 
185 . 758-761 (1991).) Thus, alteration of gene dehvery mediated by recombinrut, adenovirus 
vectors to specific cell rypes has great u„,i,y for , vanety of gene therapy app.iea.ions and ,s 
thus one of the objects of the present invention. 

Hexon and penton capsomeres are the major component on .he surfuce of me vnon. 
Their consnmen, polypeptides, no, H. HI and IV, conuin tyrosine residues .ha, are exposed 
on ,he surface of me virion and can be labeled - e.g., by iodination of input, pantc.es. 
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The fiber is an elongated protein which exists as a trimer of three identical polypeptides 
(polypeptide IV) of 582 amino acids in length. The N-terminus of the fiber mediates binding to 
the penton base to form what is generally called the penton capsomere. The C-terminus of the 
fiber is involved in initial binding of the virus to cellular receptors. 

The 35,000+ base pair (bp) genome of adenovirus type 2 has been sequenced and the 
predicted amino acid sequences of the major coat proteins (hexon, fiber and penton base) have 
been described. (See, e.g., Neumann et al., Gene 69 : 153-157 (1988); Herisse et al., Nujl 
Acids Res. 9 : 4023-4041 (1981); Roberts et al., J. Bjol. Chem, 259: 13968-13975 (1984); 
Kinloch et al., T Rinl Chem. 259 : 6431-6436 (1984); and Chroboczek et al., V|rol, 161 : 
549-554 (1987).) 

The sequence of Ad5 DNA was completed more recently; its sequence includes a total 
of 35,935 bp. Portions of many other adenovirus genomes have also been sequenced. It is 
presently understood that the upper packaging limit for adenovirus virions is about 105% of 
the wild-type genome length. (See, e.g., Belt, et al., J. Virol. 67(10): 591 1-21 (1993).) Thus, 
for Ad2 and Ad5, this would be an upper packaging limit of about 38kb of DNA. 

Adenovirus DNA also includes inverted terminal repeat sequences (ITRs) ranging in 
size from about 100 to 150 bp, depending on the serotype. The inverted repeats enable single 
strands of viral DNA to circularize by base-pairing of their terminal sequences, and the 
resulting base-paired "panhandle" structures are thought to be important for replication of the 
viral DNA. 

For efficient packaging, the ITRs and the packaging signal (a few hundred bp in length) 
appear to comprise the "minimum requirement." Helper-dependent vectors lacking all viral 
ORFs but including these essential as elements (the ITRs and contiguous packaging sequence) 
have been constructed, but the virions package less efficiently that the helper and package as 
multimers part of the time, which suggests that the virus may "want" to package a fuller DNA 
complement (see, e.g.. Fisher, et al., Virology 217: 1 1-22 (1996). 
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Wide some prefer . - ******* Ad - vecrora for tar *. rcpUcauo, 
compcen, OT raise safe.y issue, rhe viral vecrors of rhe presen, invendon may rernm *- 
Z » Ure genome packaged whhin - U. drey may re* .heir rhey 

d0 „ M ac, as infecdous agenra .0 rhe exren. dm, .hey cause disease in ft. aubjeos ,0 whom 
they are adminisiered for iherapeuric purposes. 

„ is ,o be appeared dm, .he viral vecrors of dae pmsen, invent have several 
«nc. advances over adenovM and Ad-derived vecrors described in rhe m. For exampm. 
rec ombi„a.ion of such vecrors is rare; Acre are no known ~ns of human mahgnances 
.» adenoviral infecrions despire common human infecrion whh adenoviruses; me genome 
^ be manipulared ,0 accommodare foreign genes of a fairiy subsranrial size; and hos, 
proliferarion is no. required for expression of adenoviral pro.e,ns. 

An exrension of rhis invendon is dun rhe Ad-derived viral vecrors disclose herem may 
be used ro urge, and deliver genes m.0 specific cells by incorporaring *e armebmenr sequence 
for odrer receprors (such as CD4) on.o ,he fiber proreiu b, mcombinam DNA rechniques. mus 
producmg a chimeric mo,ecu.e. This should resul, in me ahillry ro mrge, and.dellver genes mm 
. wide range of cell rypes whh me advanrage of evading recognirion by me hosts immune 
sys.em The widain-disclosed delivery sys,ems drus provide for increased flexibilir, in gene 
design ro enable srable inregradon inro proliferaring and nonproliferaring cell rypes. 

For example, published Imemauonal App. No. W095/26412 and Krasnykh. e. d. 0. 
VirsUfl; 6839-46 (1996)), .he disclosures of wh.ch are incorporared by reference herem, 
describe mod.fica.ions ma, may be made ,0 me adenovirus fiber promin. Such mooificaUons 
arc usefu, in alrering dre mrgeung mechanism and specific!., of adenovirus and could readdy be 
udhred in conjunct wim .he consuls of .he presen. invenrion ,o rarge. ,he novel vrml 
vecors disclosed herein .0 differen, receprors and differen. cells. Moreover, modificarions ro 
fiber pr„,ein which alrer its uopism may permi. grearer conrrol over rhe loc.Haa.ion of v,ml 
vectors in therapeutic applications. 

Similarly, incorporation of various structural proteins into cell lines of the present 
invention, whether or not those proteins are modified, is also contemplated by the present 
invention. Thus, for example, modified penton base polypeptides such as those desenbed » 
Wl ckham, et al. (J. Virol- 70: 6831-8 (1996)) may have therapeutic utility when used according 
to the within-disclosed methods. 
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C. Packaging Cell Lines 

The first generation of recombinant adenoviral vectors currently available tend to have 
a deletion in the first viral early gene region which is generally referred to as El , which 
comprises the Ela and Elb regions. (These regions typically span genetic map units 1.30 to 
9.24.) Figure 3 in chapter 67 nf Fields Virology . 3d Ed. (Fields et al. (eds.), Lippincott-Raven 
Publ., Philadelphia, (1996), p. 2116) illustrates a transcription and translation map of 
adenovirus type 2 (Ad2) that is a helpful example. 

■ According to various published reports, deletion of the viral El region renders the 
recombinant adenovirus defective for replication and incapable of producing infectious viral 
particles in the subsequently-infected target cells. Thus, the ability to generate El -deleted 
adenovirus is often based on the availability of the human embryonic kidney packaging cell line 
called 293. This cell line contains the El region of adenovirus, which provides El gene region 
products to "support" the growth of El -deleted virus in the cell line (see, e.g., Graham et al., L 
Gen. Virol 36 : 59-7 1 ( 1 977)). 

- Nevertheless, the inherent problems with current first-generation recombinant 
adenoviruses have raised increasing concerns about their use in patients. For example, several 
recent studies have shown that El -deleted adenoviruses are not completely replication- 
incompetent (see Rich, Hum. Gene. Ther. 4 : 461-476 (1993); Engelhardt, et al., Nature Genet. 
4:27-34(1993)). 

Three general limitations are associated with the adenoviral vector technology. First, 
infection both in vivo and in vitro with the adenoviral vector at high multiplicity of infection 
("MOI") has resulted in cytotoxicity to the target cells, due to the accumulation of penton 
protein, which is itself toxic to mammalian cells (Kay, Cell Biochem. 17E; 207 (1993)). 
Second, host immune responses against adenoviral late gene products, including penton 
protein, cause the inflammatory response and destruction of the infected tissue which received 
the vectors (Yang, et al., PNAS USA 92 : 4407-441 1 (1994)). Lastly, host immune responses 
and cytotoxic effects together prevent the long-term expression of transgenes and cause 
decreased levels of gene expression following subsequent administration of adenoviral vectors 
(Mittal, et al., Virus Res. 28 : 67-90 (1993)). 
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The packaging cell lines disclosed herein suppon viral vectors with deletions of major 
portions of the viral genome, without the need for helper viruses. 
D . Therapeutic Vjraj Vectors and Rf.lated System s 

1 Nnrlpit! Acid Sepments 

A therapeutic viral vector or composition of the present invention comprises a 
nucleotide sequence encoding a protein or polypeptide molecule - or a biologically acUve 
fragment thereof - which may be used for therapeutic applications, as described herem. 
therapeutic viral vector or composition may further comprise an enhancer element or a 
promoter located 5' to and controlling the expression of such a therapeutic nucleotide seouence 

or gene. . 
In general, promoters are DNA segments that contain a DNA sequence that controls 

the expression of a gene located 3' or downstream of the promote, The promoter is the DNA 
sequence to which RNA polymerase specifically binds and initiates RNA synthesis 
(transcription) of that gene, typically located 3' of the promote, If more than one nucleic actd 
sequence encoding a panicular polypeptide or protein is included in a therapeutic viral vector 
or nucleotide sequence, more than one promoter or enhancer element may be included, 
particularly if that would enhance efficiency of expression. For purposes of the present 
invention, regulable (inducible) as well as constitutive promoters may be used, either on 
separate vectors or on the same vector. 

A subject therapeutic nucleotide composition or vector consists of a nucleic acid 
molecule that comprises at least 2 different operative.y linked DNA segments. The DNA can 
be manipulated and amplified by PCR as described herein and by using standard techmques, 
such as those described in Msl-kLi^^ Sambr °° k * 

al eds Cold Spring Harbor, New York (1989). Typically, to produce a therapeutic v,ral 
vector of the present invention, the sequence encoding the selected therapeutic composition 
and the promoter or enhancer are operatively linked to a DNA molecule capable of 
autonomous replication in a cell either in vivo or in vitro. By operatively linking the enhancer 
element or promoter and the nucleotide sequence encoding the therapeutic nucleotide 
composition to the vector, the attached segments are replicated along with the vector 
sequences. 



WO 98/13499 



-23- 



PCT/EP97/05251 



Thus, a recombinant DNA molecule (rDNA) of the present invention is a hybrid DNA 
molecule comprising at least 2 nucleotide sequences not normally found together in nature. In 
various preferred embodiments, one of the sequences is a sequence encoding an Ad-derived 
polypeptide, protein, or fragment thereof. Stated another way, a therapeutic nucleotide 
sequence of the present invention is one that encodes an expressible protein, polypeptide or 
fragment thereof, and it may further include an active constitutive or regulatable (e.g. 
inducible) promoter sequence. 

A therapeutic viral vector or composition of the present invention is optimally from 
about 20 base pairs to about 40,000 base pairs in length. Preferably the nucleic acid molecule 
is from about 50 bp to about 38,000 bp in length. In various embodiments, the nucleic acid 
molecule is of sufficient length to encode one or more adenovirus proteins or functional 
polypeptide portions thereof. Since individual Ad polypeptides vary in length from about 19 
amino acid residues to about 967 amino acid residues, corresponding nucleotide sequences will 
range from about 50 bp up to about 3000 bp, depending on the size and of individual 
polypeptide-encoding sequences that are "replaced" in the viral vectors by therapeutic 
nucleotide sequences of the present invention. 

Various Ad proteins are comprised of more than one polypeptide sequence. Thus, 
deletion of the corresponding genes from an Ad vector as taught herein will thus allow the 
vector to accommodate even larger "foreign" DNA segments. Thus, if the sequences encoding 
■ one or more adenovirus polypeptides or proteins are supplanted by a recombinant nucleotide 
sequence of the present invention, the length of the recombinant sequence can conceivably 
extend nearly to the packaging limit of the relevant adeno virus-derived vector. 

In view of the fact that preferred embodiments disclosed herein are helper-independent 
Ad-derived vectors, the entire wild-type Ad genome cannot be completely supplanted by 
. recombinant nucleic acid molecules without transforming such a vector into a vector requiring 
"help" of some kind. However, the Ad-derived vectors of the present invention do not depend 
on a helper virus; instead, the vectors of the present invention are propagated in cell lines 
stably expressing proteins or polypeptides that have been removed from said vectors to allow 
the addition of "foreign" DNA into the vectors. In various disclosed embodiments, specific 
early region and structural polypeptides are deleted from the vectors of the present invention, 
thereby enabling the vectors to accommodate recombinant nucleic acid sequences (or 
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casseues) of various length, For example, Ad-derived vectors of the present invention may 
easily include 12 kb or more of foreign (or "therapeutic") DNA sequences. 

The therapeutic (or foreign) nucleotide sequence can be a gene or gene fragment that 
encodes a protein or polypeptide - or a biologically active fragment thereof - that proves a 
desired therapeutic effect such as replacement of alpha 1 -antitrypsin or cystic fibres* 
transmembrane conductance regulator protein (CFTR) and the like. (See, e.g., Crystal, et al., 
H^Bfiacto* 42-51 (1994); Zabner, et al., 0*75: 207-216 (1993); Knowles, et al., 
j^^Oi): 823-831 (1995); and Rosenfeld, et al., CeJL6J: 143-155 (1992), the 
disclosures of which are incorporated by reference herein.) 

An Ad-derived vector of the present invention may also comprise a nucleoude 
seqU ence encoding a protein, polypeptide or fragment thereof that is effective in regulating the 
cell cycle - such as P 53, Rb, or mitosin - or which is effective in inducing cell death, such as 
thymidine kinase. (See, e.g., published International App. No. WO 95/1 1984, the disclosures 
of which are incorporated by reference herein.) It is further contemplated that a therapeuuc 
protein or polypeptide expressed by a therapeutic viral vector of the present invention may be 
used in conjunction with another therapeutic agent when appropriate - e.g., a thymidme kmase 
m etabolite may be used in conjunction with the gene encoding thymidine kinase and its gene 
product - in order to be even more effective. 

Alternatively, a therapeutic viral vector can include a DNA or RNA oligonucleotide 
sequence that exhibits enzymatic therapeutic activity without needing to be translated into a 
polypeptide product before exerting a therapeutic effect. Examples of the latter include 
antisense oligonucleotides that will inhibit the transcription of deleterious genes or ribozymes 
that act as site-specific ribonucleases for cleaving selected mutated gene sequences. In another 
variation, a therapeutic nucleotide sequence of the present invention may compnse aDNA 
construct capable of generating therapeutic nucleotide molecules, including ribozymes and 
antisense DNA, in high copy numbers in target cells, as described in published PCT application 
No WO 92/06693 (the disclosure of which is incorporated herein by reference). Other 
preferred therapeutic nucleotide sequences according to the present invention are capable of 
delivering HIV antisense nucleotides to latently-infected T cells via CD4. Similarly, delivery of 
Epstein-Barr Virus (EBV) EBNa- 1 antisense nucleotides to B cells via CR2 is capable of 
effecting therapeutic results. 
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As noted elsewhere herein, an Ad-derived vector of the present invention may also 
include a promoter sequence. Both constitutive and regulatable (often called "inducible") 
promoters are useful in constructs and methods of the present invention. For example, some 
useful regulatable promoters are those of the CREB-regulated gene family and include - and - 
inhibin, - gonadotropin, cytochrome c, glucagon, and the like. (See, e.g., published 
International App. No. WO96/14061, the disclosures of which are incorporated by reference 
herein.) 

A regulatable or inducible promoter may be described as a promoter wherein the rate of 
RNA polymerase binding and initiation is modulated by external stimuli. Such stimuli include 
various compounds or compositions, light, heat, stress, chemical energy sources, and the like. 
Inducible, suppressive and repressive promoters are considered regulatable promoters. 

Regulatable promoters may also include tissue-specific promoters. Tissue-specific 
promoters direct the expression of the gene to which they are operably linked to a specific cell 
type. Tissue-specific promoters cause the gene located 3' of it to be expressed predominantly, 
if not exclusively, in the specific cells where the promoter expressed its endogenous gene. 
Typically, it appears that if a tissue-specific promoter expresses the gene located 3' of it at all, 
then it is expressed appropriately in the correct cell types (see, e.g., Palmiter et al., Ann. Rev. 
Genet. 20 : 465-499 (1986)). 

When a tissue-specific promoter controls the expression of a gene, that gene will be 
expressed in a small number of tissues or cell types rather than in substantially all tissues and 
cell types. Examples of tissue-specific promoters include the immunoglobulin promoter 
described by Brinster et al., Nature 306 : 332-336 (1983) and Storb et al., Nature 310: 238-231 
(1984); the elastase-I promoter described by Swift et al., Cell 38 : 639-646 (1984); the globin 
promoter described by Townes et al., Mol. Cell. Biol. 5 : 1977-1983 (1985), and Magram et 
al., Mol. Cell. Biol. 9 : 4581-4584 (1989), the insulin promoter described by Bucchini et al., 
PNAS USA. 83 : 251 1-2515 (1986) and Edwards et al., Cell 58 : 161 (1989); the 
immunoglobulin promoter described by Ruscon et al., Nature 314: 330-334 (1985) and 
Grosscheld et al., Cell 38 : 647-658 (1984); the alpha actin promoter described by Shani, MoL 
Cell Biol. 6 : 2624-2631 (1986); the alpha crystalline promoter described by Overbeek et al., 
PNAS USA 82 : 7815-7819 (1985); the prolactin promoter described by Crenshaw et al., 
Genes and Development 3 : 959-972 (1989); the proopiomelanocortin. promoter described by 



PCT/EP97/05251 

WO 98/13499 

-26- 

, pnas USA 85- 8890-8894 (1988); the bm-lhyroid sumulating honnone 
Tremblay el ah. EIW "™ " „. . .,.„„ fl9S9); ihe 

(BTSH) proper described * Ta.su™ « a!., Kisp^Uia^. * 
(B ' P . vilus (MMT V) promoter described by Muller et al.. EdU4- ' "5 

(WW); dte album*, promoter desenbeo y 

(1986); the keratin promoter described by Vassal et al., O^SJ^SAX. 

, e ostconeerin promoter described by McVey et a.., LM^m^- .U . . 

a988) ; me proscate-specificpromoter described by Allison e, a. M^L^ 
,54-2257 OM me opsin promoter described by Nathans « al BUUBUL 
1). tbe olfactory marker protein promote, described b, Danctger a, al- EISALl^ 
989,^0 nemon-speefflc anolase (NSE) promoter described by Foster e, 

ZZ&Z^m -d d-e protandne , promos described Pesehon e. - 
^^M; ,86-197 (.989, and Braun e, al., o.^,*]**^ ^ 
^e^j^Thetfoclosures of all references cited are incorporated by reference herein.) 
2 Propositions 

ln various alternative embodiments of me present invention, tberapenuc seances *td 
Virions useful for practicing the therapeutic memo* described herein are contended. 
■ v™, of the nresent invention may contain a physiologtcally tolerable 

™nic or omerwise able to cause undesirable side effects when admmtsrered ,0 a 

subject for therapeutic purposes. ,„u„ W a,» 
As „sed herein, me - •■pharmaceutic* acceptable", "phystologtcally tolerable 
and erammatica, variations dtereof. as they refer to composidons. earriera, * uents and 

reag e„,s are used inrerchangeably and represent that me tnarerials are capable of 

reagents, ar a ^mal - without the production of undesrreble 

administration to or upon a subject - e.g., a mamma; 

physiological effects such as nausea, dtainess, gastric upset and the hire. 

Po, example, rhe presen, invention comprises therapeutic commons useful ,n the 

Tide Uce to -se «* Thetapeuric compositions designed ,o preferenually targe. 
10 epithelial ce„s may comprise an adenovims-derived veCor inching a therapeunc 
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nucleotide sequence. As described herein, a number of adenovirus-derived moieties are useful 
in the presently-disclosed therapeutic compositions and methods. 

While some of the Examples appearing below specifically recite fiber proteins, 
polypeptides, and fragments thereof, it is expressly provided herein that other structural and 
non-structural Ad proteins and polypeptides (e.g., regulatory protein s and polypeptides) may 
be used as components of the various disclosed vectors and cell lines. Moreover, chimeric 
molecules comprised of proteins, polypeptides, and/or fragments thereof which are derived 
from different Ad serotypes may be used in any of the within-disclosed methods, constructs 
and compositions. Similarly, recombinant DNA sequences of the present invention may be 
prepared using nucleic acid sequences derived from different Ad serotypes, in order to design 
useful constructs with broad applicability, as disclosed and claimed herein. 

It should also be appreciated that, while the members of Group C adenovirus - i.e., Ad 
serotypes 1, 2, 5, and 6 - are specifically recited in various examples herein, the present 
invention is in no way limited to those serotypes alone. In view of the fact that the adenovirus 
serotypes are all closely-related in structure and functionality, therapeutic viral vectors, 
packaging cell lines, and plasmids of the present invention may be constructed from 
components of any and all Ad serotypes - and the within-disclosed methods of making and 
using the various constructs and cell lines of the present invention apply to all of said 
serotypes. 

The preparation of a pharmacological composition that contains active ingredients 
dissolved or dispersed therein is well understood in the art. Typically such compositions are 
prepared as injectables -- either as liquid solutions or suspensions - however, solid forms 
suitable for solution or suspension in liquid prior to use can also be prepared. A preparation 
can also be emulsified, or formulated into suppositories, ointments, creams, dermal patches, or 
the like, depending on the desired route of administration. 

The active ingredient can be mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient and in amounts suitable for use in the 
therapeutic methods described herein. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol or the like and combinations thereof, including vegetable oils, 
propylene glycol, polyethylene glycol and benzyl alcohol (for injection or liquid preparations); 
and petrolatum (e.g., VASELINE), vegetable oil, animal fat and polyethylene glycol (for 
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o»ny p"*— * ta i,dBirci composi " on " T r 

an.top.ic a 6 ems, sootiting agon* and ft. like additives (as usual auxitay addles ,0 

— — - — 

— satis of me coroponems .heroin. PhannacuUca,,, acceptable sal* include the acd 
Jon satis (fonned wi,h ti* to aroino groups of the polypeptide, that are tamed w„h 
^ric acds sua,, as. fo. example, hydrochloric 0. phosphoric acid, - - «* 
„ ale. .artaric. .and* and the tike. S»,,s fonmed wid, the froe carboxy, groups can , so 
b e denved from ,no, g anic bases such as, fo, examp,e. sodium. P—, _ a^n 
o, U hydroxides, and such org** bases as isop.opy,amine, ui m cd,y,am,n=. l-ethylammo 
cdianol.hislidine. procaine and die like. 

Physiologically .olcable carriers aro well known in te art. Exemplary of hqu,d 
carriers arc s,erile aqueous soludons ma. contain no roarorials in addirion .0 the ac„ve _ 
ingmdiems and wa,e,. o, comain a huffe, such as sodiuro phospha,c a, physiolog,cal pH value, 
philological saline o, hod,, such as phos P ha.e-h„fferod saline. Still furihe, aqueous came. 
1 con,ain rooro .han one huffe. sal, as well as saUs such as sc4h.ro and po.ass,uro chlondes, 
dexirose, polyethylene glycol and other soluros. 

Liquid conditions can also comain »,uid phases in addi.ion . and ro tine exc.us.on 
„, waror. Exemplary of such additional liquid phases aro glycerin, vegeuble oris such as 
collonseed oil. and water-oil emulsions. 

A uteropeuric coroposHion .ypically comains an arooun, of a roe.apeu.io nuclide 
sequence 0, Ore p,esen, invention sufficien, .o delive, a ,he,pe„rica„y effeorive arooun, ro u,e 
.issue, .ypically an arooun, of a, leas, 0.1 weigh, pe.<*n, ,0 abou, 90 weigh, percent of 
^peuric nucleotide sequence pe, weigh, of ,o,al mcapeuuc coroposHion. A wcgh. percent 
isamiohy weigh, of *e,ap.u,ionucleo,ide sequence, o,o.alcoroposirion. Thus,fo, 
exarople.0.1 weigh, percen. is 0.1 grams of DNA segment pet ,00 grams of total compose. 

The the.apeu.ic nucleotide compositions coroprising symhetic oligonucleoude 
sequences of roe p.esen, invention can he prcpared usrng any sutiable roethod, such as, roe 
phospho.nes.e. o. phosphodi=s,e, methods. See Narang e. a,.. MaUagffllsA. 68:90. 
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(1979); U.S. Patent No. 4,356,270; and Brown et aL, Me.th. EnzvmoL 68:109, (1979), the 
disclosures of which are incorporated by reference herein. 

For therapeutic oligonucleotides sequence compositions in which a family of variants is 
preferred, the synthesis of the family members can be conducted simultaneously in a single 
reaction vessel, or can be synthesized independently and later admixed in preselected molar 
ratios. For simultaneous synthesis, the nucleotide residues that are conserved at preselected 
positions of the sequence of the family member can be introduced in a chemical synthesis 
protocol simultaneously to the variants by the addition of a single preselected nucleotide, 
precursor to the solid phase oligonucleotide reaction admixture when that position number of 
the oligonucleotide is being chemically added to the growing oligonucleotide polymer. The 
addition of nucleotide residues to those positions in the sequence that vary can be introduced 
simultaneously by the addition of amounts, preferably equimolar amounts, of multiple 
preselected nucleotide precursors to the solid phase oligonucleotide reaction admixture during 
chemical synthesis. For example, where all four possible natural nucleotides (A,T,G and C) are 
to be added at a preselected position, their precursors are added to the oligonucleotide 
synthesis reaction at that step to simultaneously form four variants. 

This manner of simultaneous synthesis of a family of related oligonucleotides has been 
previously described for the preparation of "Degenerate Oligonucleotides" by Ausubel et al. 
( Current Protocols in Molecular Biology . Suppl. 8. p.2.1 1.7, John Wiley & Sons, Inc., New 
York (1991)), and can readily be applied to the preparation of the therapeutic oligonucleotide 
compositions described herein. 

Nucleotide bases other than the common four nucleotides (A,T,G or C), or the RNA 
equivalent nucleotide uracil (U), can also be used in the present invention. For example, it is 
well known that inosine (I) is capable of hybridizing with A, T and G, but not C. Examples of 
other useful nucleotide analogs are known in the art; many may be found listed in 37 C.F.R. 
§1.822. 

Thus, where all four common nucleotides are to occupy a single position of a family of 
oligonucleotides, that is, where the preselected therapeutic nucleotide composition is designed 
to contain oligonucleotides that can hybridize to four sequences that vary at one position, 
several different oligonucleotide structures are contemplated. The composition can contain 
four members, where a preselected position contains A,T,G or C. Alternatively, the 
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com posi,,on can contain rwo -te «■» • F— — P**" < 0MainS 1 °< <;""" 
the capacity the hybrtdine at Oat position ,o all fou, possible common nucleosides. Ftnally. 
other nucleotides may be included a. She preselected posiuon mat have d* capacuy .0 
hvbri diae in a non-destabilizing manner with more man one of me common nucleoddes m a 
manner similar to inosine. 

3 ExeiegsjfiE V^ tnr Systems 

The introduction of exogenous DNA into eucaryotic cells has become one of the most 
powerful tools of the molecular biologist. The term "exogenous" encompasses any therapeuuc 
composition of this invention which is administered by the therapeuuc methods of thts 
invention. Thus, "exogenous" may also be referred to herein as "foreign," "non-native," and 
the like The methods of this invention preferably require efficient delivery of the DNA mto 
the nucleus of the recipient cell and subsequent identification of cells that are expressing the 
foreign DNA. 

A widely-used plasmid is pBR322, a vector whose nucleotide sequence and 
endonuclease cleavage sites are well known. Various other useful plasmid vectors are 
described in the Examples that follow. 

A vector of the present invention comprises a nucleic acid (preferably DNA) molecule 
capable of autonomous replication in a cell and to which a DNA segment, e.g., a gene or 
polynucleotide, can be operatively linked so as to bring about replication of the attached 
segment In the present invention, one of the nucleotide segments to be operatively linked to 
vector sequences encodes at least a portion of a therapeutic nucleic acid molecule - » effect, a 
nucleic acid sequence that encodes one or more therapeutic proteins or polypeptides, or 
fragments thereof. 

In various embodiments, the entire peptide-coding sequence of the therapeutic gene « 
inserted into the vector and expressed; however, it is also feasible to construct a vector which 
also includes some non-coding sequences as well. Preferably, however, non-coding sequences 
are excluded. Alternatively, a nucleotide sequence for a soluble form of a polypeptide may be 
utilized. Another preferred therapeutic viral vector includes a nucleotide sequence encoding at 
,east a portion of a therapeutic nucleotide sequence operatively linked to the vector for 
expression. As used herein with regard to DNA sequences or segments, the phrase 
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"operatively linked" generally means the sequences or segments have been covalently joined 
into one piece of DNA, whether in single or double stranded form. 

The choice of viral vector into which a therapeutic nucleotide sequence of this 
invention is operatively linked depends directly, as is well known in the art, on the functional 
properties desired, e.g., vector replication and protein expression, and the host cell to be 
transformed ~ these being limitations inherent in the art of constructing recombinant DNA 
molecules. Although certain adenovirus serotypes are recited herein in the form of specific 
examples, it should be understood that the present invention contemplates the use of any 
adenovirus serotype, including hybrids and derivatives thereof. As one will observe, it is not 
unusual or outside the scope of the present invention to utilize nucleotide and/or amino acid 
residue sequences of two or more serotypes in constructs, compositions and methods of the 
invention. 

As one of skill in the an will note, in various embodiments of the present invention, 
different "types" of vectors are disclosed. For example, one "type" of vector is used to deliver 
particular nucleotide sequences into a packaging cell line, with the intent of having said 
sequences stably integrate into the cellular genome; these "types" of vectors are generally 
identified herein as complementing plasmids. A further "type" of vector described herein 
carries or delivers nucleotide sequences in or into a cell line (e.g., a packaging cell line) for the 
purpose of propagating therapeutic viral vectors of the present invention; hence, these vectors 
are generally referred to herein as delivery plasmids. A third "type" of vector described herein 
is utilized to carry nucleotide sequences encoding therapeutic proteins or polypeptides to 
specific cells or cell types in a subject in need of treatment; these vectors are generally 
identified herein as therapeutic viral vectors or Ad-derived vectors. 

In one embodiment, the directional ligation means is provided by nucleotides present in 
the upstream nucleotide sequence, downstream nucleotide sequence, or both. In another 
embodiment, the sequence of nucleotides adapted for directional ligation comprises a sequence 
of nucleotides that defines multiple directional cloning means. Where the sequence of 
nucleotides adapted for directional ligation defines numerous restriction sites, it is referred to 
as a multiple cloning site. 

A translatable nucleotide sequence is a linear series of nucleotides that provide an 
uninterrupted series of at least 8 codons that encode a polypeptide in one reading frame. 
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sniu b,e ,ype. such as a viral vecor (RNA or DNA,, naked ^M- or cedar DNA. or 
vesi c,e or envelope — g d> nucleic acid maurial aod any polypepudes fta. are .0 ba 

inserted into the cell. . 

A preferred viral vecor i„ which rhcnrpe.dc nucleoride composirions of rhis mvennon 

« presen, is derived from adenovirus (Ad). 1. is also desirable d,a, t. vecor comam « 
poorer sequence. As raugh, herein, viral vecors of .his invendon nray be des,goed and 
consumed in such a way die, drey specific* urge, a preset recipien, cell .yps. 
pending on dre nana, of drerapy one seeks ,0 ad^nUrer. Meihods of makdng an usmg 
jmpc.be vira, veorors dra, rargei specific cel. are furdner described in ft. Examples ft. 



follow 



Novel vecrors and commons may also be designed end prepamd ro prefemnrially 
rarger cells iha, mrghr no, ofterwis. be uugced by wild-rype adenovirus virion, For example. 
a o*er ,0 urge, no,ep„helial celU, one following fte reaching* of dxe prexen, specficanon 
may be able » prepare a fterapeudc vecor including a nueleoude sequence encoding a foreign 
P ,o,ein. polypepdde o, odrer ligand diraced ,0 a non-epirhelia, cel. 0, ,0 a different 
L ,ha, generally urgced by a paniclax adenovim, Examp,es of useful ligandsdueced „ 
specific racepiors (idendfied in parendreses) include dre V3 loop of HIV gp.20 (CD4); 
rmnsfemn (uansferrin recepror); LDL (LDL recep.ors); and deglycosylared P ro,e,na 
(asia ,„g,ycoprorein recepror). Various useful ligands which may be added ,0 adenovnus fiber - 
. a „d mchods for pmparrng and anaching same - are se, form in published Imenmional App. 
No WO05/26412, tte disclosures of which are inco.pora.ed by reference hecn. 

Useful Uganda which may be encoded by a foreign nueleoude sequence contained 
wdhin a viral vecor of ,bc prescn, invendon, or wh,ch may he linked ro proreins or 
polypepudes expressed .hereby after said vacrors axe adminisured ,0 a subjec, also mdude 
a„,ibod,es and a,uchm=n, sequancas, ax well as receprors dremselvea. For example, an„bod,es 
,„ cell reccpm, molecules such ax inregrina and dre Hire, MHC Class I and Class H, 
axialoglycoprorein racepror, rransfe™ recepmrs, LDL recepror, CD4, and CR2 are bo. a few 
which am useful according «o the presen, invendon. I, is also undersrood fta. the ligands 
.ypically bound by recepror, as wall as analogs .o drose Uganda, may be used as cellular 
targeting agents, as disclosed herein. 
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E. Therapeutic Methods 

The vectors of the present invention are particularly suited for gene therapy. Thus, 
various therapeutic methods are contemplated by the present invention. 

For example, it has now been discovered that Ad-derived viral vectors are capable of 
delivering a therapeutic nucleotide sequence to a specific cell or tissue, thereby expanding and 
enhancing treatment options available in numerous conditions in which more conventional 
therapies are of limited efficacy. Accordingly, methods of gene therapy utilizing these vectors 
are within the scope of the invention. Vectors are typically purified and then an effective 
amount is administered in vivo or ex vivo into the subject. 

For example, the compositions may be used prophylactically or therapeutically in vivo 
to disrupt HTV infection and mechanisms of action by inhibiting gene expression or activation, 
via delivery of antisense HIV sequences or ribozymes to T cells or monocytes. Using methods 
of the present invention, one may target therapeutic viral vectors as disclosed herein to specific 
cells and tissues, including hematopoietic cells, as infection of such cells appears to be 
mediated by distinct integrins to which viral vectors of the present invention may readily be 
targeted. (See, e.g., Huang, et al., J. Virol. 70 : 4502-8 (1996).) 

Other useful therapeutic nucleotide sequences include antisense nucleotide sequences 
complementary to EBV EBNa-1 gene. Use of such therapeutic sequences may remediate or 
prevent latent infection of B cells with EBV. As discussed herein and in the Examples below, 
targeting and delivery may be accomplished via the use of various ligands, receptors, and other 
appropriate targeting agents. 

Thus, in one embodiment, a therapeutic method of the present invention comprises 
contacting the cells of a subject infected with EBV or HIV with a therapeutically effective 
amount of a pharmaceutically acceptable composition comprising a therapeutic nucleotide 
sequence of this invention. In a related embodiment, the contacting involves introducing the 
therapeutic nucleotide sequence composition into cells having an EBV or HTV-mediated 
infection. 

Methods of gene therapy are well known in the art (see, e.g., Larrick and Burck, Gene 
Therapy: Application of Molecular Biology . Elsevier Science Publ. Co., Inc., New York, NY 
(1991); Kriegler, One Transfer and Expression - A 1 ahnratorv Manual. W. H. Freeman and 
Company, New York (1990)). The term "subject" should be understood to include any animal 
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r™ natient such as any murine, rat, bovine, porcine, canine, feline, 
- particularly mammalian -- patient, sucn « j 

equine, ursine, or human patient. 

When the foreign gene carried in the vector encode, a rumor suppressor gene or 
M0 ,her anti-tumor prorein. the vecror is usefir, >„ treat or rednce ^ =* " ' 
subjKt , .0 inhihi. « pro.ifcra.ion in a subject or ro anrdiorare a parrrcuiar. tulared 
S*. Padroiogic hyperproliferarive ceiis - characteristic of various disease states^eh 

:in 8 jl «— . — - - — b,add " — ■ co,on cance " ,mg 

cancer various leukemias, and lymphomas. 

' Non-parhologic hype^iifemrivc cc„s are foond, for example, in ce„s assured wad, 
wound repair. Padtologic hypetproUferative cells, however, characteristically exhibit loss o 
eontact inhibition and a deehne in .heir ah* .0 selective,, adhere which impi.es a change ,n 
,h. surface properties of rhe cell and a firnher breakdown in intereeMa, communtcatton. 
These changes include stimulation ,o divide and the ability .0 seoete proreolyric enzymes. 
The present invenrion also conrempla.es methods of depledng suitable samples of 

L „. reconsdrurion via fte tntroduction of a w„d,ype tumor suppressor gene ,n o dte 
preparation using a vector of dns .nvendon. As used herein, a suirable sample . : defined as 
a heterogeneous cel, preparation obtain* from a parien, e.g.. a mixed populatton of coda 
containing both phenotypically normal and pathogenic cells. 

linistration includes - bu, is no, limited to - dte induction o, therapeutic agents 
„, One present .nvendon into a cell or subject via vanous means, including dime injection. 

Tnerapeutic agents as disclosed herein may also be combined fo, administration of an effeouve 
amount of dte agents with a pharmaceutically-acceptable carrier, a, described herent. 

AS used herein, "effective amount" generally means the amount of vector (or proretns 
produced/released thereby) which achieves a positive outcome in the subject to whom the 
vector is administered. The tot* volume admtnistered wil! necessarily v*y depending on fte 
«* of administrarion, as those of skill in the relevant an wi>l appreoiare, and dosages may 
vary as well. 
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The dose of a biologic vector is somewhat complex and may be described in terms of 
the concentration (in plaque-forming units per milliliter (pfu/ml)), the total dose (in pfus), and 
the estimated number of vectors administered per cell (the estimated multiplicity of infection or 
MOI). Thus, if a vector is administered via infusion - say, across nasal epithelium - at a 
constant total volume, the respective concentration, etc. may be described as follows: 



Concentration Volume Dose Estimated 

(pfu/ml) (mD tefii! MOI 

107 2 2xl0 7 1 

10 8 2 2 x 10 8 10 

10 9 2 2 x 10 9 100 
IQlO 2 2xl0 10 1000 



In general, when adenoviral vectors are administered via infusion across the nasal 
epithelium, administered amounts producing an estimated MOI of about 10 or greater are 
much more effective than lower dosages. (See, e.g., Knowles, et al., New Eng. J. Med. 333 : 
823-831 (1995).) Similarly, when direct injection is the preferred treatment modality - e.g., 
direct injection of a viral vector into a tumor - doses of 1 x 10 9 pfu or greater are generally 
preferred. (See. e.g., published International App. No. W095/1 1984.) 

Thus, depending on the mode of administration, an effective amount administered in a 
single dose preferably contains from about 10 6 to about 10 15 infectious units. A typical 
course of treatment would be one such dose per day over a period of five days. As those of 
skill in the art will appreciate, an effective amount may vary depending on (1) the pathology or 
other condition to be treated, (2) the status and sensitivity of the patient, and (3) various other 
factors well known to those of skill in the art, such as the patient's tolerance to other courses 
of treatment that may have been applied previously. Thus, those of skill in the art may easily 
and precisely determine effective amounts of the agents/vectors of the present invention which 
may be administered to a particular patient, based on their understanding of and evaluation of 
such factors. 
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The presen, invention also coniemplares »** ° f ™"° rati " S f^"' 08 * . 
hsraaen Z by hypen-ro.ifcra.ive cells or generic defects in a subject, by ads— * .0 tine 

— -r^r" 

J ig n gene encoding a gene produc, (.,. poiypep.de or prorein, having rhe abihry .0 
111 the pa-hoiogy. under suiuble condition, As used herein. .he .em. "genenc defec, 
^ my disease, condition or abnormal which m*. ta. inherit facto*, e.g. 
Hundngron s Dise^e. Tay-Sachs Disuse, or Sickle Cell Disease. 

The presen, in.en.ion former provides methods for reducing .he proHfcrauon of rumor 
«„s in a subjec. by inducing inro the uimor mass an effective antoun. of an adenov.^ 
expression vecror conning an an-rumor gen. otiner rhan a rumor suppressor gene. Th 
X.or.enecaoencode.forexanrple.dhyn.dine^CTX). An effeetive antoun, of a 
Ilpeuric agent - d.en administered ,o .h. subject rhe dterapeutic agent, in the presence of 

.he ami-rumor gene, is toxic to die cell. 

Using thymidine kinase as exemplruy, me merapeudc agent is a thynudme kmase 
m e,abo,i,c such as ganciclovir (GCV). 6-merhoxypuhne arabinonucleoside (araM). or a 
bnctional equivalent .hereof. Bom dte thymidine kinase gene and ,he rhymidine kinase 
mlab o,i,e mus, be used concu m n.,y in order to exen a toxic effect on .hehos, =e . In me 
presence of .he TK gene. GCV is phosphorylared and becomes a poten, inhtbuor of DNA 
ynlhesis. whereas araM is converted ,o rhe cyromxic anaholire araATP. Thus, .he P ree,se 
me ,hod of acrion or synergism is no, relevant to therapeutic efficacy; what is relevant* me 
to that me concu.cn. use of appropriate genes and therapeutic agent, may effecuvely 
ameliorate a specific disease condition. 

Another usefu, example con,empla.es use of a vector of the presen. invention wh,ch 
expresses die enxyme cytosine deaminase. Such a vecror could he used in nonaction wid, 
Imisrration of dte dmg S-ffuorouraci, (Austin and Hube, ™™ ««» 

ortherecentiy-describedE. coliDeo gene in combination wid, 6.tnemyl-punne-2 • 
deoxyribonucleoside (Sorsoher e. al, GjniltergBai 233-238 (1994)). 

M with .he use of the rumor suppressor genes described previously, the use of orher 
anti-rumor genes, ether alone or in combination with the appropriate .herapeutio agent, 
piovides a trearmen, fo, dte uncon.rolied eel, grow* or proliferation chaxacteristic o tumors 
and mahanancies. Thus, me presen, invention provides therapies ,0 M <ne uncontrolled 
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cellular growth in a patient, thereby alleviating the symptoms or the disease or cachexia present 
in the patient. The effect of this treatment includes, but is not limited to, prolonged survival 
time of the patient, reduction in tumor mass or burden, apoptosis of tumor cells, or the 
reduction in the number of circulating tumor cells. Means of quantifying the beneficial effects 
of this therapy are well known to those of skill in the art. 

The present invention provides a recombinant adenovirus expression vector 
characterized by the partial or total deletion of one or more adenoviral structural protein genes, 
such as the gene encoding fiber, which allows the vector to accommodate a therapeutic, 
foreign nucleic acid sequence encoding a functional foreign polypeptide, protein, or 
biologically active fragment thereof. For example, such a functional polypeptide moiety may 
be a suicide gene or a functional equivalent thereof, of which the anti-cancer gene TK is but 
one example. TK genes, when expressed, produce a gene product which is lethal to the cell, 
particularly in the presence of GCV. One source of the TK gene is the herpes simplex virus 
(HSV), albeit other sources are known as well and may be used as taught herein. The TK gene 
may readily be obtained from HSV by methods well known to those of skill in the art. For 
example, the plasmid pMLBKTK in E. coli HB101 (from ATCC #39369) is a source of the 
HSV- 1 TK gene, which may be used as disclosed herein. (See, e.g. published International 
application No. WO 95/1 1984, the disclosures of which are incorporated by reference herein.) 

A therapeutic gene sequence may be introduced into a tumor mass by combining the 
adenoviral expression vector with a suitable pharmaceutical^ acceptable carrier. Introduction 
can be accomplished, for example, via direct injection of the recombinant Ad vector into the 
tumor mass. In the specific case of a cancer such as hepatocellular carcinoma (HCC), direct 
injection into the hepatic artery can be used for delivery, because most HCCs derive their 
circulation from this artery. Similar techniques of administration may be applied to other 
specific types of tumors and malignancies, as is known to those of skill in the art. 

A method of tumor-specific delivery of a tumor-suppressor gene is accomplished by 
contacting target tissue in a subject with an effective amount of a recombinant Ad-derived 
vector of this invention. In the case of anti-tumor therapy, the gene is intended to encode an 
anti-tumor agent, such as a functional tumor suppressor gene product or suicide gene product. 
The term "contacting" is intended to encompass any delivery method for the efficient transfer 
of the vector, such as via intra-tumoral injection. 
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to mother exa.pl, adenovirus veerors of the present invention can be use4 to transfer 
8 „es ,o centra, nervous sysrem <CNS) oumors h vivo. Using srereoracuc deKvery, Ad- 
Lvadveororscan-sferganasinrorheCNS^ndedfor^orrherapy. For example, 
Badi e. e, al. ttaW» 9,0-9,6 ,.994). tocorporared hymference herem, report 
tot 50* and 90% transducrion « vecror ruers of approximarely ,0' and .0> P^e-fonnang 

(pfe/nd, were observed to to vnro experiments. In toeix to Wo srudies using 
appr op„ate animal brain rumor nrodek, ft. above »' were obsenred ro have a cytoparhtc 
I! more than 50, reducrion in rumor ce„ grow* was nored a, ,0* pfn/m, no « effects 
were uored when .iters as high as ,0> 0 pfn/nd were injecred inro dre brain rissue of subjec. 
atomals (14). Thus, toe use of rirers grearer rhan 1 o' pfu/ml appear appropriare when 

challenging CNS rumors. „,„ f ,h. 

The presem invention also conunrpla.es merhods for daermnung .he efficacy of <he 
.itoto-disclosed therapeutic compositions and metoods. One such metood for 
efficacy utilizes toe human/SOD (severe combined immnnodefrcien.) mouse mode, ofEBV- 
toduced LPD aymphoprohferanve disease) ro sscenain wherher EBV-anrisense rherapeudc 
uucleoride sequences block mmor formation. (See. e.g., Pisa, er a,., BkaadJSt "3-179 
„ W Rowe, e, a,.. 0m Top Mtobisl Immunol. 1» 325 (1990)-. and Cannon, e. a).. L 
OH maa, g 5 : 1333-1337 (1990). the disclosures of which are incorporated by reference 

herein.) , 
Finally the use of Ad vectors of the present invention to prepare med.caments for the 

treatment, therapy and/or diagnosis of various diseases is also contemplated by this invenuon. 

Moreover, other anti-tumor genes may be used in combination with the correspond^ 

therapeutic agent to reduce the proliferation of tumor cells. Such other gene-and-therapeuuc- 

agent combinations are known to those of skill in the art and may be applied as taught herem. 

F . Qmaruclion nf Therannitir WM YsEtSS for Gene Delivery 

For in vitro gene transfer, administration is often accomplished by first isolating a 
selected cell population from a patient such as lung epithelial cells, lymphocytes and the like 
followed by in vitro gene transfer of the therapeutic compositions of this invention and the 
replacement of the cells into the patient. In vivo therapy is also contemplated, e.g., v.a the 
administration of therapeutic compositions of this invention by various delivery mean, For 
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example, aerosol administration and administration via subcutaneous, intravenous, 
intraperitoneal, intramuscular, ocular means and the like are also within the scope of the 
present invention. 

Other gene-delivery methods are also useful in conjunction with the methods, 
compositions and constructs of the present invention; see, e.g., published International 
Application No. WO 95/1 1984, the disclosures of which are incorporated by reference herein. 

Similarly, various non-human animals having inserted therein the vectors or 
transformed cells of this invention. These "transgenic" animals are made using methods well 
known to those of skill in the art. For example, see U.S. Patent No. 5,175,384 (the disclosures 
of which are incorporated by reference herein). 

The present invention also contemplates various methods of targeting specific cells - 
e.g., cells in a subject in need of diagnosis and/or treatment. As discussed herein, the present 
invention contemplates that the viral vectors and compositions of the present invention may be 
directed to specific receptors or cells, for the ultimate purpose of delivering those vectors and 
compositions to specific cells or cell types. The viral vectors and constructs of the present 
invention are particularly useful in this regard. 

In general, adenovirus attachment and uptake into cells are separate but cooperative 
events that result from the interaction of distinct viral coat proteins with a receptor for 
attachment and v integrin receptors for internalization. Adenovirus attachment to the cell 
surface via the fiber coat proteins has been discovered to be dissociable and distinct from the 
subsequent step of internalization, and the present invention is able to take advantage of and 
function in conjunction with these differing receptors. 
G. Other Applications 

The cell lines, viral vectors and methods of the present invention may also be used for 
purposes other than the direct administration of therapeutic nucleotide sequences. In one such 
application, the production of large quantities of biologically active proteins or polypeptides in 
cells transfected with the within-disclosed viral vectors is contemplated herein. For example, 
human lymphoblastoid cells may be transfected with an integrative viral vector of the present 
invention canying a human hematopoietic growth factor such as the gene for erythropoietin 
(EPO); cells so transfected are thus able to produce biologically active EPO. (See, e.g., Lopez 
et al., Gene 148 : 285-91 (1994).) 
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Vanous other applications and uses of the within-described methods, cell lines, 
plasm ids, vectors, and compositions of the present invention shall become apparent upon 

closer examination of the Examples that follow. 

EXAMEL1S 

The Mowing exanp.es are » • <"* « ** *« " ,V " , "° n ' 

A, such, one following desoripdon provide, details of manner in which m* 
^dimen.ofdrepresen.invendonmaybemadeend.ed. This descripdon, wh„e 
exempli of dae presen, invendon, is nor .0 he consul as *dM, Ununng rhe .nvennon. 
V^adons end e q u,valen.s, now ta own or lerer deveioped. which would he wirhin dre 
uudersranding and wchnical compete of one skilled in rhis an are ro he consadered - realm, 
within the scope of this invention. 

Example 1 

E^paratioj] " f A,Vwnvini« ParVflfinp Cell LiOSS 
Cell lines that are commonly used for growing adenovirus are useful as host cells for 

adenovirus-transformedhuman embryonic kidney cell line obtained from the ATCC, havmg 
Accession Number CRL 1573; HeU, a human epimelialca.inoma cell nne(A^^ss IO n 
Number CCL 2); A549, a human lung carcinoma cell line (ATCC Accession Number CCL 
,889V and the like epithelial-derived cell line, As a result of the adenovirus transformauon. 
the 293 cells contain the El early region regulatory gene. • All cells were maintained » 
complete DMEM + 10% fetal calf serum unless otherwise noted. 

The cell lines of this invention allow for the production and propagation of novel 
adenovirus-based gene delivery vectors hav,ng deletions in preselected gene regions by cellular 
complementation of adenoviral gene, To provide the desired complementation of such 
deieted adenovira! genomes in order to generate a novel viral vector of the present invenuon, 
plasmid vectors that contain preselected functional units were designed as described herem. 
Such units include but are not limited to El early region, ?????and the viral fiber gen. The 
preparation of plasmids providing such complementation, thereby being "complementary 
piasmids or constructs", that are stably inserted into host cell chromosomes are desenbed 



below. 
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A . Preparation of an E4-Expressin° PI as mid fo r Com plementation of E4-Gene-Deleted 
Adenoviruses 

The viral E4 regulatory region contains a single transcription unit which is alternately 
spliced to produce several different mRNAs. The E4-expressing plasmid prepared as described 
herein and used to transfect the 293 cell line contains the entire E4 transcriptional unit as 
shown in Figure 1. A DNA fragment extending from 175 nucleotides upstream of the E4 
transcriptional start site including the natural E4 promoter to 153 nucleotides downstream of 
the E4 polyadenylation signal including the natural E4 terminator signal, corresponding to 
nucleotides 32667-35780 of the adenovirus type 5 (hereinafter referred to as Ad5) genome as 
described in Chroboczek et al. (Virol, 186:280-285 (1992), GenBank Accession Number 
M73260), was amplified from Ad5 genomic DNA, obtained from the ATCC, via the 
polymerase chain reaction (PCR). Sequences of the primers used were 
5 ' CGGT AC AC AGAATTC AGG AG AC AC AACTCC3 * (forward or 5' primer referred to as 
E4L) (SEQ ID NO 1 ) and S'GCCTGGAICCGGGAAGTTACGTAACGTGGG AAAAC3' 
(SEQ ID NO 2) (backward or 3* primer referred to as E4R). To facilitate cloning of the PCR 
fragment, these oligonucleotides were designed to create novel sites for the restriction enzymes 
EcoRI and BamHI, respectively, as indicated with underlined nucleotides. DNA was amplified 
via PCR using 30 cycles of 92 C for 1 minute, 50 C for 1 minute, and 72 C for 3 minutes 
resulting in amplified full-length E4 gene products. 

The amplified DNA E4 products were then digested with EcoRI and BamHI for 
cloning into the compatible sites of pBluescript/SK+ by standard techniques to create the 
plasmid pBS/E4. A 2603 base pair (bp) cassette including the herpes simplex virus thymidine 
kinase promoter, the hygromycin resistance gene, and the thymidine kinase polyadenylation 
signal was excised from the plasmid pMEP4 (Invitrogen, San Diego, CA) by digestion with 
Fspl followed by addition of BamHI linkers (5'CGCGGATCCGCG3*) (SEQ ID NO 3) for 
subsequent digestion with BamHI to isolate the hygromycin-containing fragment. 

The isolated BamHI-modified fragment was then cloned into the BamHI site of pBS/E4 
containing the E4 region to create the plasmid pE4/Hygro containing 8710 bp (Figure 2). The 
pE4/Hygro plasmid has been deposited with the ATCC as described in Example 3. The 
complete nucleotide sequence of pE4/Hygro is listed in SEQ ID NO 4. Position number 1 of 
the linearized vector corresponds to approximately the middle portion of the pBS/SK+ 
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u ■ ««« 2 as a thin line between the 3' BamHl site in the hygromycin 
backbone as shown in Figure I as a inin 

^ 3' BcoRI sue in the E4 inse, The , and 3' ends of the E4 gene are ,ca t e at 

h ;; myci n insert are iocated at respective nuclide positions 3330 and 6470. n the cione 
±aX was selected for use, the E4 and hygromycin resistance genes were d.vergent.y 

transcribed. rTmr . Hnl . piasmid for nrniplrm rr^"" ^ Fihe.r-Gene-Deletgd 
B Pre paration of a Fiher-fcrpressing riaMiu 

AHpnnviruses 

To prepare . fiber-encoding consrruc,, primes were designed ,0 amp„fy to fiber 
eoAng mgion from Ad5 genomic DNA wim me addidon of unique BamHI and M » . 
he r Jy ends o, me fragmen,, respeouvciy. Tbe Ad 5 nuc.eo.ide sequence , avadab.e w,m 
*. GenBanU Accession Number M.8369. Tbe 5' and 3' primers bad me reapecive nuCeoude 
sequences of 5' ATGQ2AT££AAGATGAAGCGCGCAAGACCG3' (SEQ ID NO 5) and 
5 . C ATAACa^S!222CrTCrrTATrCTTGGGC3' (SEQ ID NO 6), where .be inserted 
B am HI and Nod sires axe Weed by under.in.ng. The 5' primer also conhdned a nuOeobde 
s nbs,„u,ion 3 nuCeoddes 5' of ,h. second ATG codon (C .o A, ma, is me inUiadon sire. The 
nncleoudc subsum.ion was inciuded so as ,0 improve me consensus for ini.ia.ion of fiber 
protein translation. 

The amp.if.ed DNA fragmen, was insened i„,o me BamHI and Nod sues of pcDNA 3 
nnvurogen) ,o crea,e .he p.asmid designed pCDNA3/Fiber having 7,48 bp, dm p.asmid map 
o, wh.ch is shown in figure 3. The paren, piasmid conmined .he CMV promoter, me bovine 
gro wm hormone (BHG) terminator and .be gene for conferring neomycin romance. The v.ra> 
sequence inc.uded in .bis consume, corresponds ,„ nuc.eo.idea 31040-3279. of one Ad5 

'""""me complere nucleoddc sequence of pCDN A3/Fiber is listed in SEQ ID NO 7 whom 
ft. nucleoude posidon 1 corresponds ,o approximate* ,he midd.e ofthepcDNA 3 veomr 
sequence. The 5' and 3' ends of «he fiber gene are ,ocated a, respecdve nuc.eodde rmsmons 
9 1 6 with ATG and 266 1 with T A A. 

To enhance expression of fiber pro,e,n by me consdrudve CMV promoter provded by 
t. pcDNA vecor. a Bg.H fragmen, conraining rbe .ripardm .eader (TPL) of adenovirus rype 2 
was excised from pRD. ,2a (Sbeay e, a,., .5:856-862 (.993) and inserted mm 
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the BamHI site of pCDNA3/Fiber to create the plasmid pCLF having 7469 bp, the plasmid 
map of which is shown in Figure 4. The adenovirus tripartite leader sequence, present at the 5* 
end of all major late adenoviral mRNAs as described by Logan et al., Proc, Natl, Acad. Sci., 
USA, 81 :3655-3659 (1984) and Berkner, BioTechnioues . 6:616-629 (1988), is encoded by 
three spatially separated exons corresponding to nucleotide positions 6071-6079 (the 3' end of 
the first leader segment), 7101-7172 (the entire second leader segment), and 9634-9721 (the 
third leader segment) in the adenovirus type 2 genome. The tripartite sequence, however, also 
shows correspondence with the Ad5 leader sequence having three spatially separated exons 
corresponding to nucleotide positions 6081-6089 (the 3' end of the first leader segment), 711 1- 
7182 (the entire second leader segment), and 9644-9845 (the third leader segment and 
sequence downstream Of that segment). The corresponding cDNA sequence of the tripartite 
leader sequence present in pCLF is listed in SEQ ID NO 8 bordered by BamHI/Bglll 5' and 3' 
sites at respective nucleotide positions 907-912 to 1228-1233. 

The pCLF plasmid has been deposited with the ATCC as described in Example 3. . 
The complete nucleotide sequence of pCLF is listed in SEQ ID NO 8 where the nucleotide 
position 1 corresponds to approximately the middle of the pcDNA 3 parent vector sequence. 
The 5' and 3 ends of the Ad5 fiber gene are located at respective nucleotide positions 1237- 
1239 with ATG and 2980-2982 with TAA. The rest of the vector construct has been 
previously described above. 

C. feneration of an Adenovirus Packaging Cell Line Carrying Plasmids Encoding 
Functional E4 and Fiber Proteins 

The 293 cell line was selected for preparing the first adenovirus packaging line as it 
already contains the El gene as prepared by Graham et al., J. Gen. Virol., 36:59-74 (1977) and 
as further characterized by Spector, Virol., 130:533-538 (1983). Before electroporation, 293 
cells were grown in RPMI medium + 10% fetal calf serum. Four x 10 6 cells were 
electroporated with 20 ug each of pE4/Hygro DNA and pCLF DNA using a BioRad 
GenePulser and settings of 300 V, 25 uF. DNA for electroporation was prepared using the 
Qiagen system according to the manufacturer's instructions (Bio-Rad, Richmond, CA). 

Following electroporation, cells were split into fresh complete DMEM + 10% fetal calf 
serum containing 200 ug/ml Hygromycin B (Sigma, St. Louis, MO). 
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Fmm expanded — . — « — "* *< "MICROTURBOGEN" 
DNA was assessed by PCR using the primer pair E4R and 0RF6L 

{5 K3CTTAAGCGGCCGCGAAGGAGAAGTCC3') (SEQ ID NO 9), me lane, of wh,ch . a 
5 forward prunes neas adenovirus 5 open reading frame 6. Refer roHgnre , for posiuon of 

thP nrimers relative to the E4 genes. 

' One eione. designared - selecre. exhibiting .rered grow* properties relanve ,0 
era, seen in paren, eeU line 293. The 2,! e.one conrained rhe eapeered produc, ind,canng rhe 
presen ceofinsertedDNA cording romost, if nor all. ofrhe E4 fragmen, oonrarned m 
^grop^d.The.nealllinehasheendepo^-fteATCCaa^m 

Example 3 This line was further evaluared by runplifreaUon using Ore pnmer parr E4UE4R 

Genonuc Sourhem bloning was penonned on DNA resrricmd wrrh EooRI andBamH. The 
I ,mgmen, was .hen dereored a, approximarely one copy/genome compared ,0 sundards w,dr 
„ EcoRI/BamHI E4 (ragmen, as oloned inro pBS/E4 for use as a Ubeled probe wdh dre 
Genius sysrem aceording ,o manufacrurers insrructrons (Boehringer Mannheim. Indranapohs. 
M ,„ DN A from me 2 . 1 cell line, me expecred labeled inremal fragment pE4/Hygro 
hybridized wirh me isolared E4 sequences, m addnion. .he probe hybridized ,0 a larger 
fra-men. which may be rhe reach of a second insertion e»en. (Figure 5). 

Ahhough me 211 cell line was no. selecred by neomycin resistance, thus intaong me 
absence of frber gene, ,o confirm me lack of frher gene. rhe2>.ce.. line was analyzed for 
expressron of fiber prorein by iudireo. .mmunofiuorescence with an ruui-frber polyclonal 
anubodyandaFfTClabeled anri-rabbir IgG (KPL, as secondary. No immunoreactivny was 
de.ec.ed. Therefore. .0 generare 2, 1 clones training recomhinan, fiber genes, .he 21 1 clone 
was expanded by growing in RPMI medium and subjeored ro addirional elerarroporarion wrm 
rhe fiber-encoding pCU plasmid as described above. 

Following elecroporarion. cells were plared inDMEM + 10% feud calf serum and 
monies were selecred wirh 200 pg/ml 04,8 (Giboo, Gaithersburg. MD). Posirive car, Imes 
remained hygromyein resist. These Candida, sublines of 2. 1 we„ men screened for fiber 
prorem expression by indireer .mmunofiuorescence as described above. The three sublmes 
screened 21 1 A. 2UB and 2UR. along wim a number of other sublines, all exhibited nuclear 
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staining qualitatively comparable to the positive control of 293 cells infected with AdRSV gal 
(1 pfu/cell) and stained 24 hours post-infection. 

Lines positive for nuclear staining in this assay were then subjected to Western blot 
analysis under denaturing conditions using the same antibody. Several lines in which the 
antibody detected a protein of the expected molecular weight (62 kd for the Ad5 fiber protein) 
were selected for further study including 21 1A, 21 IB and 21 1R. The 21 1 A cell line has been 
deposited with ATCC as described in Example 3. 

Western blot analysis using soluble nuclear extracts from these three cell lines and a 
seminative electrophoresis system demonstrated that the fiber protein expressed is in the 
functional trimeric form characteristic of the native fiber protein as shown in Figure 6. The 
predicted molecular weight of a trimerized fiber is 186 kd. The lane marked 293 lacks fiber 
while the sublines contain detectable fiber. Under denaturing conditions, the trimeric form was 
destroyed resulting in detectable fiber monomers as shown in Figure 6. Those clones 
containing endogenous El, newly expressed recombinant E4 and fiber proteins were selected 
for use in complementing adenovirus gene delivery vectors having the corresponding 
adenoviral genes deleted as described in Example 2. 

D. Preparation of an FJ -Expressing Plasmid fo r Complementation of El -Gene-Deleted 
Adenoviruses 

In order to prepare adenoviral packaging cell lines other than those based on the El - 
gene containing 293 cell line as described in Example 1C above, plasmid vectors containing El 
alone or in various combinations with E4 and fiber genes are constructed as described below. 

The region of the adenovirus genome containing the Ela and Elb gene is amplified 
from viral genomic DNA by PCR as previously described. The primers used are E1L, the 5' or 
forward primer, and El R, the 3' or backward primer, having the respective nucleotide 
sequences 5TCGAGCTAGCGACTGAAAATGAG3* (SEQ ID NO 10) and 
5'CCTCTCGAGAGACAGCAAGACAC3* (SEQ ID NO 11). The E1L and E1R primers 
include the respective restriction sites Nhel and Xhol as indicated by the underlines. The sites 
are used to clone the amplified E 1 gene fragment into the Nhel/Xhol sites in pMAM 
commercially available from Clontech (Palo Alto, CA) to form the plasmid pDEX/El having 
1 1 152 bp, the plasmid map of which is shown in Figure 7. 
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„f „r.T-V/E1 is listed in SEQ ID NO 12 where the 
The complete nucleotide sequence of pDEX/El is Ustea ma* 

^ ,n proximately 1454 nucleotides from the 3' end of the 
nucleotide position 1 corresponds to approximately 
MAM balne vecior sequence. The pDEX/E , plasmid includes » 
^lovitna genome posted do— M-* - — * P— ^ 
, at 4998 in the pDEX/El plasmid, of me gtaocorbeoid-mducib* mouse mammary 

to , „ows stable — . - b. isolated using ^^^^T 
action. The pMAM backbone occupies nucleotide positions ,-,454 and 5005-1 1 .52 of 

Fanaigpaia and Fiber Ttpwins 

To create separate adenovirus packaging eel, Hnes equivalent to tbat of the 2,1 

adenoviral genomes are se,ected for tranafection win, the plaatnid construe* a, desenbed 
lav Jept^lebos,ce,,sinc,nde A 54,He,a,Vero a ndn.l, t ece,,,nes^,n 

E*amp,e , The se,ec,ed eel, line is tmnsfected with the separare plasmtds. pDEX/E, end 
pCLF respecttvely for expressing El. and fiber complementary proteins. Followmg 

pto mids are isolared by selection with neomycin and HAT. Integration o, full-length copy of 
th e El gene is assessed by PCR amplification from genomic DMA using the pnmer M 
ElL/ElR.as desenbed above. Functional tnsertton of the fiber gene is assayed by stamtng 
with the anti-fiber antibody as previously described. 

The resultant stably integrated eel, „ne is then used as a packaging eel, system to 
complement adenoviral gene delivery vectors having the cotresponding ndenovtra, gene 
deletions as described in Example 2. 
f. teEaariaroijt^^ 

"-nrl-niTT "ene-rtrHrd Adenovjtusa 

The methods described in the pteceding Examples rely on the use of two plasmtds, 
pE4/Hyg-o and pCLF. or, pCLF and pDEXEl for generating adenoviral oe,l pnokagtng 

» ..emative embodiments contemplated for use with rhe methods of this tnvenrton, 
clplementingplasmidscon-ng two or more adenovh, genes for expressing of encoded 



WO 98/13499 



-47- 



PCT/EP97/05251 



proteins in various combinations are also prepared as described below. The resultant plasmids 
are then used in various cell systems with delivery plasmids having the corresponding 
adenoviral gene deletions. The selection of packaging cell, content of the delivery plasmids 
and content of the complementing plasmids for use in generating recombinant adenovirus viral 
vectors of this invention thus depends on whether other adenoviral genes are deleted along 
with the adenoviral fiber gene, and, if so, which ones. 

j. Pre paration of a Complementing Plasmid C ontaining Fib er and El Adenoviral 

Genes 

A DNA fragment containing sequences for the CMV promoter, adenovirus tripartite 
leader, fiber gene and bovine growth hormone terminator is amplified from pCLF prepared in 
Example IB using the forward primer 5'GACGGATCGGGAGATCTCC3' (SEQ ID NO 13), 
that anneals to the nucleotides 1-19 of the pCDNA3 vector backbone in pCLF, and the 
backward primer 5'CCGCCTCAGAAGCCATAGAGCC3 , (SEQ ID NO. 14) that anneals to 
nucleotides 1278-1257 of the pCDNA3 vector backbone. The fragment is amplified as 
previously described and then cloned into the pDEX/El plasmid, prepared in Example ID. 
For cloning in the DNA fragment, the pDEX/El vector is first digested with Ndel, that cuts at 
a unique site in the pMAM vector backbone in pDEX/El, then the ends are repaired by 
treatment with bacteriophage T4 polymerase and dNTPs. 

The resulting plasmid containing El and fiber genes, designated pEl/Fiber, provides 
both dexamethasone-inducible El function as described for DEX/E1 and expression of Ad5 
fiber protein as described above. A schematic plasmid map of pEl/Fiber, having 14455 bp* is 
shown in Figure 8. 

The complete nucleotide sequence of pEl/Fiber is listed in SEQ ID NO 15 where the 
nucleotide position 1 corresponds to approximately to 1459 nucleotides from the 3' end of the 
parent vector pMAM sequence. The 5' and 3 ends of the Ad5 El gene are located at 
respective nucleotide positions 1460 and 4998 followed by pMAM backbone and then 
separated from the Ad5 fiber from pCLF by the filled-in blunt ended Ndel site. The 5' and 3' 
ends of the pCLF fiber gene fragment are located at respective nucleotide positions 10922- 
14223 containing elements as previously described for pCLF. 

The resultant pEl /Fiber plasmid is then used to complement one or more delivery 
plasmids expressing El and fiber. 
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The pE ,/Fiber consult Is men used >o uansfec, . «ko< hos, ceil as described . 
E^ple IE .0 gene,a« suble cnromosomai insen,o„s prefomned , previously descnted 
Mowed by seiecuon on HAT medium. Tbe subie ce„s m men used as packaging ceils as 
described in Example 2. 

pCLF p^Ted as described in Example IB is partially digested with Bgin to cut only 
at the sue in the P CDNA3 backbone. The pE4/Hygro plasmid prepared in Example 1 A » 
dlgest ed with BamHI to produce a fragment containing E4. The E4 fragment is then insened 
into the BamHI she of pCLF to form plasmid P E4/Fiber. The resultant plasmid proves 
expression of the fiber gene as described for pCLF and E4 function as described for 

pE4/Hygro. n — 

A schematic plasmid map of P E4/Fiber, having 10610 bp, is shown m Ftgure 9. The 
compl ete nuclide sequence of P E4/Fiber is listed in SEQ ID NO 16 whe« the nucleotide 
position 1 corresponds to approximately 14 bp from the 3' end of the parent vector P CDNA3 
backbone sequence. The 5' and 3 ends of the Ad5 E4 gene are located at respective 
positions 2 1 and 3149 followed by fused Bgin/BamHI sites and pCDNA3 backbone includmg 
the CMV promoter again followed by Bgin/BamHI sites. The adenovirus leader sequence 
be *ins at nucleotide position 4051 and extends to 4366 followed by fused BamHI/Bgin sites 
and the 5' and 3' ends of the fiber gene located at respective nucleotide positions 4372 and 

Stable chromosonal insertions of P E4/Fiber in host cells are obtained as described 



6124. 
above. 
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Example 2 

Preparation of Adenoviral Gene Del ivery Vectors 
Ikin g Adenoviral Packaging Cell Lines 
Adenoviral delivery vectors of this invention are prepared to separately lack the 
combinations of El/fiber and E4/fiber. Such vectors are more replication-defective than those 
previously in use due to the absence of multiple viral genes. A preferred adenoviral delivery 
vector of this invention that is replication competent but only via a non-fiber means is one that 
only lacks the fiber gene but contains the remaining functional adenoviral regulatory and 
structural genes. Furthermore, the adenovirus delivery vectors of this invention have a higher 
capacity for insertion of foreign DNA. 

A. Preparation of Adenoviral Gene Delivery Vectors Hav ing Specific G ene Deletions and 
Methods of Use 

To construct the El/ /fiber deleted viral vector containing the LacZ reporter gene 
construct, two new plasmids were constructed. The plasmid pA ElBp gal was constructed as 
follows. By digestion of pSVp gal (ProMega Corp., Madison, WI) with Vspl, a DNA 
fragment containing the SV40 regulatory sequences and the E. coli -P-galactosidase gene was 
isolated. The resulting fragment having overhanging ends was then filled in with Klenow 
fragment of DNA polymerase 1 in the presence of dNTPs followed by digestion with BamHI. 
The resulting fragment was cloned into the EcoRV and BamHI sites in the polylinker of pA 
ElsplB (Microbix Biosystems, Hamilton, Ontario) to form p A E1B gal that therefore 
contained the left end of the adenovirus genome with the Ela region replaced by the LacZ 
cassette (nucleotides 6690 to 4151) of pSVp gal. Plasmid DNA was prepared by the alkaline 
lysis method as described by Bimboim and Doly, Nuc. Acids Res.. 7:1513-1523 (1978) from 
transformed cells used to expand the plasmid. DNA was then purified by CsCl-ethidium 
bromide density gradient centrifugation. 

The second plasmid (pDV44), prepared as described herein, is derived from pBHGlO, 
a vector prepared a described by Ben et al., Proc. Natl. A cad. ScL USA. 91 :8802-8806 
(1994) and commercially available from Microbix, which contains an Ad5 genome with the 
packaging signals at the left end deleted and the E3 region (nucleotides 28133:30818) replaced 
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schematic of the plasmid map is shown m Figure 13. The pll J pias 
wilh Pacl and Sail - remove dre fiber. E4, and inverred terminal repea, (ITR, sequencea 
This fragmen, is replaced wtdr a fragmen, containing ure ITR segmenrs and rhe E4 
2 e„ewhichis generated by PCR ampfficanon from pBHG.O using rhefollcwrng 

W and Ban*., respectively. Coning Uris fragmen. inro me Pacl and blunr ended SaH 
ste of me p, 1 .3 backbone resulred in a snbsrimrion of me fused ITRs. E4 regron and fiber 
gene present in pBHOlO , by the ITRs and E4 region alone. 

!„ general, rhe method for vims production by ..combination of plasmids foUowed b, 
complementation in eel, enhure involves the isolation of recombinant vinrses by —don 
of any one of rhe adenovinta packaging ce„ sysrems prepared in Example 1 , namely 21. A, 
2, IB. 21 1R. A549. Vero cells, and the like, with plasmids casing sequences cotmapondmg 

to viral gene delivery vectors. 

A selected cell line is plared in dishes and couansfeoted wirh pDV44 and p E1B gal 
using the calcium phosphate method as desoribed by Ben e, a... taUM^U™. 
9, 8802.8806 (1994). Recombinarion between the overlapping adenovirus sequences tn .he 
w0 plasnuds leads ,0 me creation of a f 1,-lengm viral chromosome where P DV44 and pi 
E ,Bp ga. recombine ,0 fonn a recombinant adenovims vector having mulnple ddetion, Tne 
deletion of El and of the fiber gene from the viral chromosome is compensated for by dre 
sequences integrared into the packaging cell genome, and infectious virus parttclea are 
produced. The plaques .bus generated am isolared and stocks of dte recombinan, vims t»e 
produced by standard methods. 

m a preferred embodiment of this invention, a deliver, plasmid is prepared .bar does 
no, require the above-described recombination even. .0 prepare a therapeuric viral vector 
having a fiber gene deletion. A single ddivery piasmid containing =11 -he adeoovira! genome 
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necessary for packaging but lacking the fiber gene is prepared from plasmid pFG140 
containing full-length Ad85 that is commercially available from Microbix. The resultant 
delivery plasmid referred to as pFG140-f is then used with pCLF stably integrated cells as 
described above to prepare a therapeutic viral vector lacking fiber. In a preferred aspect of this 
invention, the fiber gene is replace with a therapeutic gene of interest for preparing a 
therapeutic delivery adenoviral vector. 

Vectors for the delivery of any desired therapeutic gene are prepared by cloning the 
gene of interest into the multiple cloning sites in the poly linker of commercially available p 
ElsplB (Microbix Biosystems), in an analogous manner as performed for preparing p E1B gal 
as described above. The same cotransfection and recombination procedure is then followed as 
described herein to obtain viral gene delivery vectors. 

The recombinant viruses thus produced are used as gene delivery tools both in cultured 
cells and in vivo . For studies of the effectiveness and relative immunogenicity of multiply- 
deleted vectors, virus particles are produced by growth in the packaging lines described in 
Example 1 and are purified by CsCl gradient centrifugation. Following titering, virus particles 
are administered to mice via systemic or local injection or by aerosol delivery to lung. The 
LacZ reporter gene allows the number and type of cells which are successfully transduced to 
be evaluated. The duration of transgene expression is evaluated in order to determine the 
long-term effectiveness of treatment with multiply-deleted recombinant adenoviruses relative 
to the standard technologies which have been used in clinical trials to date. The immune 
response to the improved vectors described here is determined by assessing parameters such as 
inflammation, production of cytotoxic T lymphocytes directed against the vector, and the 
nature and magnitude of the antibody response directed against viral proteins. 

Versions of the vectors which contain therapeutic genes such as CFTR for treatment of 
cystic fibrosis or tumor suppressor genes for cancer treatment are evaluated in the animal 
system for safety and efficiency of gene transfer and expression. Following this evaluation, 
they are used as experimental therapeutic agents in human clinical trials. 
B, Retargeting of Adenoviral Gene Delivery Vectors b v Producing Viral Particles 

romaining Different or Altered Fiber Proteins 

As the specificity of adenovirus binding to target cells is largely determined by the fiber 
protein, viral particles that incorporate modified fiber proteins or fiber proteins from different 
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, a v.rtnr^ have different specificities. Thus, the expression 
^pnnviral serotypes (pseudotyped vectors) nave Qinci v 

adenovmuserotyp oackasi ng cells as described above is also 

of the native Ad5 fiber protein in adenovirus packag.n c 

i;~Me to nroducuon of different fiber proteins. 
^ ! 1 Ipec, of — chimeric fiber proteins are 

o e, al 1 Virol 69:2850-2857 (1995). The authors showed the, the 

melho ds of Stevenson e, ... » ^ ^ rf fc ^ ^ 

that isolated exchanged in order to produce 

— — 'ires-. 

packaging eells, the construes described herein are nsed ro rransfec, cel.s along 
DNA J 11^. T^* - A- — - seances are available * * «P»- 
ATG and ending with the termination codon TAA. for ciora 

^U*. — sltesBan.nn^Noaforclo.nginropcDNAp^ns 
described in Example 1 A. PCR is performed as described above. 

The resulun, prodocrs are .hen nsed ,0 — chimeric fibe— *PCR 
tm overlap extension, as described by Horron «, a,.. BiaM^. 8:525-535 <1990 . The 
Ad5 fiber mi, and shaft regions (5T8; fine nncieoride region encoding antino aetd restdue 

needing ammo acid residue posirions ,36 ,0 3,9, to form the 5TS3H fiber ehtmem. 

acid residues posirtons , ,0 ,35) are connected ,0 dte Ad5 fiber « <» * 
nucleotide regton encoding the nnuno acid residue posirions 404 ,0 581, to fonr, the 3TS5H 
L cntmern The fusions are ntade a, the conserved 3XWT <SEQ 133 NO 2, ) seance a, One 
fiber shaft-head junction. 
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The resultant chimeric fiber PCR products are then digested with BamHI and NotI for 
separate directional ligation into a similarly digested pcDNA vector. The Ad2 leader sequence 
is then subcloned into the BamHI as described in Example 1 A for preparing an expression 
vector for subsequent transfection into 21 1 cells as described above or into the alternative 
packaging cell systems as previously described. The resultant chimeric fiber construct- 
containing adenoviral packaging cell lines are then used to complement adenoviral delivery 
vectors as previously described. Other fiber chimeric constructs are obtained using a similar 
approach with the various adenovirus serotypes known. 

In an alternative embodiment, the methods of this invention contemplate the use of the 
modified proteins including novel epitopes as described by Michael et al M Gene Therapy, 
2:660-668 (1995) and in International Publication WO 95/26412, the disclosures of which are 
incorporated by reference herein. Both publications describe the construction of a cell-type 
specific therapeutic viral vector having a new binding specificity incorporated into the virus 
concurrent with the destruction of the endogenous viral binding specificity. In particular, the 
authors described the production of an adenoviral vector encoding a gastrin releasing peptide 
(GRP) at the 3* end of the coding sequence of the Ad5 fiber gene. The resulting fiber-GRP 
fusion protein was expressed and shown to assemble functional fiber trimers that were 
correctly transported to the nucleus of HeLa cells following synthesis. 

Based on the teachings in the paper and International Publication, similar constructs are 
contemplated for use in the complementing adenoviral packaging cell systems of this invention 
for generating new adenoviral gene delivery vectors that are replication-deficient and less 
immunogenic. Heterologous ligands contemplated for use herein to redirect fiber specificity 
range from as few as 10 amino acids in size to large globular structures, some of which 
necessitate the addition of a spacer region so as to reduce or preclude steric hindrance of the 
heterologous ligand with the fiber or prevent trimerization of the fiber protein. The ligands are 
inserted at the end or within the linker region. Preferred ligands include those that target 
specific cell receptors or those that are used for coupling to other moieties such as biotin and 
avidin. The types of cell signaling as a result of binding by a ligand is dependent upon the 
specificity of that ligand; i.e, receptor internalization or lack thereof. 

A preferred spacer includes a short 12 amino acid peptide linker composed of a series 
of serines and alanine flanked by a proline residue at each end. One of ordinary skill in the art 
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„ fibers havrng posted inrerually widnin dre fiber peoreta, « «he anuno 

Id- a purred iirdre, sequence of ProScr^rAUSarMaSerA.aProG.ySer ( SEQ D> 
Cowed by a unique restriction * - « «, codons, ro — «• 

;„Lon of a coding sequence for a se,ec.ed heteroiogous ligand and .0 ensure propor 
1 don — . T*e * end of dre anbsense oHgonuCeodoa ^ 
Hp with vacror sequence in.o which ,ha oHgonucfaodda is insened » stte-drrected 
I! Is, Poiiowingtnuragenesrsof ,ha p^P ^uanca adding , a h„ k ar and s,„p codon 
sequences a nuclcouda sequence encoding a preselected iigand is obtarned, hrta* 

lineariaed corresponding restriction site. 

Into the resulrant pCLF vecror conraining a Ad5 fiber gene sequence wirh 3 
nuclides encoding a .inker and a iigand. dre Ad2 iaader sequence is inserted as prevtousiy 
led The resuitan, fiber-iigand consume, is rhen used ,0 rransfec, 2, , or the ahernanve 

systemsforusewidrrhemedrodaofdrisinvention. 

h a further entboduuent, fiber proreins encoded by fiber genes rsoiared from dtfferont 
***** serotypes a. used rnrac, fo, ,,o 2, , or an ahemarive cefi pacing 

system as previously described. 

A gene encoding the fiber protein of inures, is first Coned ,0 creute a p,a*md 
ana ,„g„us ro pCLP, and atabia ce» hues producing the fioor prorein „ generated as ^escnfcd 
above for Ad5 fiber. The adenovirus vector described which iaCcs dre fiber gene ^«n 
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only fiber gene present is the one in the packaging cells, the adenoviruses produced contain 
only the fiber protein of interest and therefore have the binding specificity conferred by the 
complementing protein. Such viral particles are used in studies such as those described above 
to determine their properties in experimental animal systems. 
q Tar geted Gene Delivery Using Viral Vector Parti cles Lacking Fiber Protein 

An alternative mode of entry for adenoviral infection of hematopoietic cells has been 
described by Huang, et al., J. Virol . 69:2257-2263 (1995) which does not involve the fiber 
protein-host cell receptor interaction. As infection of most other cell types does require the 
presence of fiber protein, vector panicles which lack fiber may preferentially infect 
hematopoietic cells, such as monocytes or macrophages. 

To produce a fiber-free adenovirus vector particle, a vector lacking the fiber gene as 
described above in Example 2 A but containing a gene of interest for delivery is amplified by 
growth in cells which do not produce a fiber protein, such as the 21 1 cells prepared in Example 
1, thereby producing large numbers of particles lacking fiber protein. The recovered fiber-free 
viral particles are then used to deliver the inserted gene of interest following the methods of 
this invention via targeting mechanisms provided by other regions of the adenoviral vector, i.e., 
via the native penton base. 

Example 3 

Deposit of Materials 

The following cell lines and plasmids have been deposited on September 25, 1996, with 
the American Type Culture Collection, 1301 Parklawn Drive, Rockville, MD, USA (ATCC): 

Material ; ATCC Accession No. 

Plasmid pE4/Hygro 97739 
Plasmid pCLF 97737 
211 Cell Line CRL-12193 
21 1A Cell Line CRL-12194 



The foregoing written specification is considered to be sufficient to enable one skilled 
in the art to practice the invention. The present invention is not to be limited in scope by the 
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' Example 4 

i u„™ 1 1 et al 1994 Characterization of the knob 
The native fiber protein is a homotnmer | Henry LJ. et al 
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fiber { Wickham T et al 1993 Cell 73:309-319} (the 58 kD Ad2 and 62 kD Ad5 fibers have very 
similar mobilities under these conditions). 

To determine whether the fiber-expressing lines could support the growth of a fiber-defective 
adenovirus, we performed one-step growth experiments using the temperature-sensitive fiber mutant Ad 
H5/5142 (the gift of Harold Ginsberg). At the restrictive temperature (39.5 °C), this mutant produces 
an underglycoslyaied fiber protein which is not incorporated into mature virions { Chee-Sheung C. C et 
al 1982 J. Virol 42: 932-950 }. This results in the accumulation of non-infectious viral particles. We 
asked whether the recombinant fiber protein expressed by our cell lines could complement the H5fjl42 

defect and rescue viral growth. 

CelHines 293,21 1A,21 IB and211R (2 x 10 6 cells/sample) were infected with H5r5l42 at 10 
pfu/cell. 48 hours later, cells were detached with 25 mM EDTA and virus was harvested by four rapid 
freeze-thaw cycles. Debris was removed by a 10 minute spin at 1500 x g, and viral titers determined by 
fluorescent focus assay { Thiel J.F et al 1967 Proc. Soc. Exp. Biol. Med. 125:892-895 } on SW480 
cells with a polyclonal anti-penton base Ab { Wickham T et al 1993 Cell 73:309-319). As shown in 
Fig. 14, the fiber mutant virus replicated to high titers in 293 cells at 32.5° C (the permissive 
temperature), but to a much lower extent at the restrictive temperature of 39.5° C. The fiber-producing 
packaging lines 21 1 A, 21 IB, or 21 1R supported virus production at 39.5* C to levels within two- to 
three-fold of those seen at the permissive temperature in 293 cells, indicating that these cells provided 
partial complementation of the fiber defect. 

Interestingly, virus yields from the fiber-producing cell lines were also somewhat higher than 
those from 293 cells at 32.5° C (the 'permissive' temperature). This suggests that fiber produced by the 
tsl42 virus may be partially defective even at the permissive temperature. Alternatively, a non- 
specific increase in adenoviral titer could result when viruses are grown in our packaging cells, by a 
mechanism not involving fiber complementation. However, we have found that viruses with wild type 
fiber genes (such as Ad.RSVbgal) replicate to identical levels either in our packaging lines or in 293 
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demonstrates that the type 5 fiber protein produced by the cells is capable of assembling into complete 
Ad3 particles. 

A vector based on Ad5 but containing the gene for the Ad7 fiber protein has been described { 
Gall J. et al 1996 J. Virol. 70:21 16-2123}, as well as Ads containing chimeric fiber genes {Krasnykh 
V. N et al J. Virol. 70:6839-6846}. Addition of a short peptide linker to the fiber in order to confer 
binding to a different cellular protein has also been reported { 8188 }. By using packaging technology 
such as that presented here, Ad vectors equipped with different fiber proteins may be produced simply 
by growth in cells expressing the fiber of interest, without the time-consuming step of generating a new 
vector genome for each application. 

Replacing or modifying the fiber gene in the vector chromosome would also require that the new 
fiber protein bind a receptor on the surface of the cells it which it is to be grown. The packaging cell 
approach will allow the generation of Ad particles containing a fiber which can no longer bind to its 
host cells, by a single round of growth in cells expressing the desired fiber gene. This will greatly 
expand the repertoire of fiber proteins which can be incorporated into particles, as well as simplifying 
the process of retargeting gene delivery vectors. 

Finally, a novel fiber-independent pathway of infection has recently been described in 
hematopoietic cells, in which penton base provides the initial virus-cell interaction by binding to integrin 
0^2 I Huang S. et al 1996 J. Virol 70: 4502-4508). This suggests that viral particles lacking fiber 
protein may be useful in targeting gene delivery to specific cell types via this pathway. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Nemerow, Glen R- 

Von Seggern, Daniel J. 

TITLE OF INVENTION : PACKAGING CELL LINES , ADENOVIRUS 
VECTORS , AND METHODS OF USING SAME 

(iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

,A) ADDRESSEE: THE SCRIPPS RESEARCH INSTITUTE 

( B) STREET: 10550 North Torrey Pines Road 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: United States 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE : Patentln Release #1.0, Version #1-25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 25-SEP-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Logan, April C. 

(B) REGISTRATION NUMBER: 33,950 

(C) REFERENCE/DOCKET NUMBER: TSRI 554.0 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE : (619) 554-2937 
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(B) TELEFAX: (619) 554-6312 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CGGTACACAG AATTCAGGAG ACACAACTCC 
30 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA (genomic) 



(iii) 



HYPOTHETICAL: NO 



(iv) 



ANTI-SENSE : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 

GCCTGGATCC GGGAAGTTAC GTAACGTGGG AAAAC 
35 

(2 ) INFORMATION FOR SEQ ID NO 

(i , SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CGCGGATCCG CG 
12 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CACCTAAATT GTAAGCGTTA ATATTTTGTT AAAATTCGCG TTAAATTTTT GTTAAATCAG 
60 

CTCATTTTTT AACCAATAGG CCGAAATCGG CAAAATCCCT TATAAATCAA AAGAATAGAC 
120' 

CGAGATAGGG TTGAGTGTTG TTCCAGTTTG GAACAAGAGT CCACTATTAA AGAACGTGGA 
180 

CTCCAACGTC AAAGGGCGAA AAACCGTCTA TCAGGGCGAT GGCCCACTAC GTGAACCATC 
240. 

ACCCTAATCA AGTTTTTTGG GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG 
300 

GAGCCCCCGA TTTAGAGCTT GACGGGGAAA GCCGGCGAAC GTGGCGAGAA AGGAAGGGAA 
, 360 

GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA GCGGTCACGC TGCGCGTAAC 
420 

CACCACACCC GCCGCGCTTA ATGCGCCGCT ACAGGGCGCG TCCCATTCGC CATTCAGGCT 
480 

GCGCAACTGT TGGGAAGGGC GATCGGTGCG GGCCTCTTCG CTATTACGCC AGCTGGCGAA 
540 

AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA GGGTTTTCCC AGTCACGACG 
600 

TTGTAAAACG ACGGCCAGTG AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
660 

GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC GAATTCAGGA GACACAACTC 
720 
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^rrr ACTGGTCTGG CCACAACTAC ATTAATGAAA 
CAAGTGCATA CTCTATGTCA TTTTCATGGG ACTGG.CT 

780 

TATTTGCCAC A^AC — AGA.AAAGA — 

840 

TTATGTTTCA ACGTGTTTAT TTTTCAATTG 
900 

AGATCACCGT ACCTTAATCA AACTCACAGA 



AGTATAGCCC CACCACCACA TAGCTTATAC 

ACCCTAGTAT T CAACC T GCC ACC.CCC.CC CAACACACAG AGTACACAGT CC TTT C T CCC 
1020 

^GCAT CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC 
CGGCTGGCCT TAAAAAGCAT CAlAiuii 

1080 

r r^ACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC 
CACACGGTTT CCTGTCGAGC CAAACGCTCA TCAo 

1140 

TCACTTAAGT TCATGT^ -AGCCACAG GCTGCTGTCC AACTTGCGGT 

1200 

„GG CCGGCGAAGG AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC 
1260 

r.r-,nr rrrrGAATAA ACTGCTGCCG CCGCCGCTCC 
ATCAGGATAG GGCGGTGGTG CTGCAGCAGC GCGCGAATAA A 

1320 

GTCCTGCAGG AATACAACAT GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCSGAGC 

aIggcgcc ttg™ GGCACAGCAG CGCACGCTGA tctcacttaa ATCAGCAGAG 
1440 

TAACTGCAGC ACAGCACCAC AA^G AAAAXCCCAC AGTGCAAGGC GCTGTATCCA 
1500 



WO 98/13499 



-65- 



PCT/EP97/05251 



AAGCTCATGG CGGGGACCAC AGAAC CCACG . TGGCC ATCAT ACCACAAGCG CAGGTAGATT 
1560 

AAGTGGCGAC CCCTCATAAA CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA 
1620 

TTCACCACCT CCCGGTACCA TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC 
1680 

CTAAACCAGC TGGCCAAAAC CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA 
1740 

CAATGACAGT GGAGAGCCCA GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA 
1800 

ATGTTGGCAC AACACAGGCA CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC 
1860 

GTTAGAACCA TATCCCAGGG AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG 
1920 

GGAAGACCTC GCACGTAACT CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC 
1980 

GGATGATCCT CCAGTATGGT AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA 
2040 

CTGTACGGAG TGCGCCGAGA CAACCGAGAT CGTGTTGGTC . GTAGTGTCAT GCCAAATGGA 
2100 

ACGCCGGACG TAGTCATATT TCCTGAAGCA AAACCAGGTG CGGGCGTGAC AAACAGATCT 
2160 

GCGTCTCCGG TCTCGCCGCT TAGATCGCTC TGTGTAGTAG TTGTAGTATA TCCACTCTCT 
2220 



CAAAGCATCC AGGCGCCCCC TGGCTTCGGG TTCTATGTAA ACTCCTTCAT GCGCCGCTGC 
2280 
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, CCG CACACCCAGC CAACCTACAC ATTCGTTCTG 

CCTGATAACA TCCACCACCG 



TCGAGGAG ^— TGGAAGAACC ATGTTTTTTT TTTTATTCCA 
CGAGTCACAC ACGGGAGGAG 



CAGAATAAGC 

ATAAL.A. 

2340 

CGGGAAGAGC 

TCACJAu /v-»~ 

2400 

AAAGATTATC CAAAAGCCA " — " 
2460 

c „ -acagcg aaagaacaga — - — - ™ G 

2520 

„G GCAAACGGGG C« AGTGGACGTA AAGGCTAAAC CCTTCAGGGT 

lL OT C TATAAACATT CCAGCACCTT CAACCATGCC CAAATAATTC TCATCTCGCC 
2640 

SCCTTCTCAi tatatctcta agcaaatccc gaa^aag .ccggggax, g T aaaaa OT 

2700 

^cagagc gcgg T ccagc TT caggg,ga agcagcgaa, ca^gca aaaa^cagg 

"icAGAG ACC T G» T AA GA^AAAAG CGGAACA^A ACAAAAA.AC GGCGA.CGGG 
2820 

r-rrr* GCTGAACATA ATCGTGCAGG TCTGCACGGA CCAGCGCGGC 
TAGGTCCCTT CGCAGGGCCA GCTGAACATA a 

2880 

^^irirrc, ATTATGACAC GCATACTCGG 
CACTTCCCCG CCAGGAACCT TGACAAAAGA ACCCACACTG 

2940 

.rCAGCGTAG CCCCGATGTA AGCTTTGTTG CATGGGCGGC GATATAAAAT 
AGCTATGCTA ACCAGCGTAG CU-^ 

3000 

GCTCAAAAAA TCAGGCAAAG CC.CGCGGAA AAAAGAAAGG ACATCGTAGT 

3060 
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CATGCTCATG CAGATAAAGG CAGGTAAGCT CCGGAACCAC CACAGAAAAA GACACCATTT 
3120 

TTCTCTCAAA CATGTCTGCG GGTTTCTGCA TAAACACAAA ATAAAATAAC AAAAAAACAT 
3180 

TTAAACATTA GAAGCCTGTC TTACAACAGG AAAAACAACC CTTATAAGCA TAAGACGGAC 
3240 

TACGGCCATG CCGGCGTGAC CGTAAAAAAA CTGGTCACCG TGATTAAAAA GCACCACCGA 
3300 

CAGCTCCTCG GTCATGTCCG GAGTCATAAT GTAAGACTCG GTAAACACAT CAGGTTGATT 
3360 

CATCGGTCAG TGCTAAAAAG CGACCGAAAT AGCCCGGGGG AATACATACC CGCAGGCGTA 
3420 

GAGACAACAT TACAGCCCCC ATAGGAGGTA TAACAAAATT AATAGGAGAG AAAAACACAT 
3480 

AAACACCTGA AAAACCCTCC TGCCTAGGCA AAATAGCACC CTCCCGCTCC AGAACAACAT 
3540 

ACAGCGCTTC ACAGCGGCAG CCTAACAGTC AGCCTTACCA GTAAAAAAGA AAACCTATTA 
3600 

AAAAAACACC ACTCGACACG GCACCAGCTC AATCAGTCAC AGTGTAAAAA AGGGCCAAGT 
3660 

GCAGAGCGAG TATATATAGG ACTAAAAAAT GACGTAACGG TTAAAGTCCA CAAAAAACAC 
3720 

CCAGAAAACC GCACGCGAAC CTACGCCCAG AAACGAAAGC CAAAAAACCC ACAACTTCCT 
3780 

CAAATCGTCA CTTCCGTTTT CCCACGTTAC GTAACTTCCC GGATCCGCGG CATTCACAGT 
3840 
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tctc cgcaa= ««««- ercame. tggagtggtg aatccgttag cgagg^ccg 

3900 

CCGGCTTCCA TTCAGGTCGA """^ 
3960 

CAGACAAGGT ATAGGGCGGC GCCTACAATC CATGCCAACC CGTTCCATGT GCTCGCCGAG 



4020 



;?GAC gatcagcggt ccagtgatcg aagttaggct ggtaagagcc 



GCGGCATAAA TCGCCG 
4080 

GCGAGCGATC CTTGAAGCTG TCCCTGATGG TC 0 ™ CCTGCC^GA ™a«C 
4140 

T= CAACGCGG «« CGAT GCCGCCGGAA GCGAGAAGAA GAAGGCCATC 
4200 

CAGCCTCGCG TCGCGAACGC CAGCAAGACG TAGCCCAGCG CGTCGGCCGC CATGCCCGC 
4260 

TT CATCCCCG TGGCCCGTTG CTCGCGTTTG CTGGCGGTGT CCCCGGAAGA AATATATTTG 
4320 

p , r , urP rrcCCCKGCG TCTTGTCATT GGCGAATTCG 
CATGTCTTTA GTTCTATGAT GACACAAACC CCGCCCAGCG 

4380 

AACACGCAGA TGCAGTCGGG GCGGCGCGGT CCCAGGTCCA CTTCGCATAT TAAGGTGACG 
4440 

CGTGTGGCCT CGAACACCGA GCGACCCTGC AGCGACCCGC TTAAGAGCGT CAACAGCGTG 
4500 

CCGCAGATCC CGGGCAATGA GATATGAAAA AGCCTGAACT CACCGCGACG TCTGTCGAGA 
4560 

AGTTTCTGAT CGAAAAGTTC GACAGCGTCT CCGACCTGAT GCAGCTCTCG GAGGGCGAAG 
4620 
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AATCTCGTGC TTTCAGCTTC GATGTAGGAG GGCGTGGATA TGTCCTGCGG GTAAATAGCT 
4680 

GCGCCGATGG TTTCTACAAA GATCGTTATG TTTATCGGCA CTTTGCATCG GCCGCGCTCC 
4740 

CGATTCCGGA AGTGCTTGAC ATTGGGGAAT TCAGCGAGAG CCTGACCTAT TGCATCTCCC 
4800 

GCCGTGCACA GGGTGTCACG TTGCAAGACC TGCCTGAAAC CGAACTGCCC GCTGTTCTGC 
4860 

AGCCGGTCGC GGAGGCCATG GATGCGATCG CTGCGGCCGA TCTTAGCCAG ACGAGCGGGT 
4920 

TCGGCCCATT CGGACCGCAA GGAATCGGTC AATACACTAC ATGGCGTGAT TTCATATGCG 
4980 

CGATTGCTGA TCCCCATGTG TATCACTGGC AAACTGTGAT GGACGACACC GTCAGTGCGT 
5040 

CCGTCGCGCA GGCTCTCGAT GAGCTGATGC TTTGGGCCGA GGACTGCCCC GAAGTCCGGC 
5100 

ACCTCGTGCA CGCGGATTTC ' GGCTCCAACA ATGTCCTGAG GGACAATGGC CGCATAACAG 
5160 

CGGTCATTGA CTGGAGCGAG GCGATGTTCG GGGATTCCCA ATACGAGGTC GCCAACATCT 
5220 

TCTTCTGGAG GCCGTGGTTG GCTTGTATGG AGCAGCAGAC GCGCTACTTC GAGCGGAGGC 
5280 

ATCCGGAGCT TGCAGGATCG CCGCGGCTCC GGGCGTATAT GCTCCGCATT GGTCTTGACC 
5340 



AACTCTATCA GAGCTTGGTT GACGGCAATT TCGATGATGC AGCTTGGGCG CAGGGTCGAT 
5400 
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^rrrGGGA CTGTCGGGCG TACACAAATC GCCCGCAGAA 
GCGACGCAAT CGTCCGATCC GGAGCCGGGA CTG 

5460 

GCGCGGGCGT CTGGACCCAT GGCTGTGTAG AAGTACTCGC CGATAGTGGA AACCGACGCC 
5520 

CCAGCACTCG TCCGAGGGCA AAGGAATAGG GGAGATGGGG GAGGCTAAGT GAAACACGGA 
5580 

nn* arcCGCGCTA TGACGGCAAT AAAAAGACAG AATAAAACGC 
AGGAGACAAT ACCGGAAGGA ACCCGCGCTA IV, 

5640 

ACGGGTGTTG GGTCGTTTGT TCATAAACGC GGGGTTCGGT CCCAGGGCTG GCACTCTGTC 
5700 

GATACCGCAC CGAGACCCCA TTGGGGCCAA TACGCCCGCG TTTCTTCCTT TTCCCCACCC 
5760 

CACCCCGCAA GTTCGGGTGA AGGCCGAGGG CTGGCAGCGA AGGTCGGGGG GGCAGGCCCT 
5820 

r-r-r-* rrrr^Trrrr CATGGGGAAT GGTTTATGGT 
GCCATAGCCA CTGGCCCCGT GGGTTAGGGA CGGGGTCCCC CAT 

5880 

TCGTGGGGGT TA.TATTTTG GGCGTTGCGT GGGGXCTGGT CCACGAGTGG AGTGAGCAGA 
5940 

CAGACCCATG GTTTTTGGAT GGCCTGGGCA TGGACCGCAT GTACTGGCGC GACACGAACA 
6000 

CCGGGCGTCT GTGGCTGCCA AACACGCCCG ACCCCCAAAA ACCACCGCGG GGATTTCXGG 
6060 

n*nr*rCGG TCATGGCTGC GCCCCGACAC CCGCCAACAC CCGCTGACGC 
CGCCCAGTGC CGTCGACCGG TCATGULi^ 

6120 

GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA CCGTCTCCGG 
6180 
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GAGCTGCATG TGTCAGAGGT TTTCACCGTC ATCACCGAAA CGCGCGAGGC AGCCGGATCA 
6240 

TAATCAGCCA TACCACATTT GTAGAGGTTT TACTTGCTTT AAAAAACCTC CCCACCTCCC 
6300 

CCTGAACCTG AAACATAAAA TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA 
6360 

TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT 
6420 

GCATTCTAGT TGTGGTTTGT CCAAACTCAT CAATGTATCT TATCATGTCT GGATCCACTA 
6480 

GTTCTAGAGC GGCCGCCACC GCGGTGGAGC TCCAGCTTTT GTTCCCTTTA GTGAGGGTTA 
.6540 

ATTTCGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG TGTGAAATTG TTATCCGCTC 
6600 

ACAATTCCAC ACAACATACG AGCCGGAAGC AT AAAG TGT A AAGCCTGGGG TGCCTAATGA 
6660 

GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG CTTTCCAGTC GGGAAACCTG 
6720 

TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA GAGGCGGTTT GCGTATTGGG 
' 67 80 

CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG 
6840 

GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG AATCAGGGGA TAACGCAGGA 
6900 

AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC CGCGTTGCTG 
6960 
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GCGTTTTTCC ATAGGCTCCG CCGGGGTGAG GAGCATCACA AAAATCGACG CTCAAGTCAG 
7020 

AGGTGGCGAA ACGGGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG AAGC.CCC^ 

g1c T C T C CTGTTCCGAC GC.GGGGGT. ACCGGATACC T GTCGGGG TT TCTCCCTTCG 
7140 

GGAAGCGTGG CGG^GA TAGCTCACGC TGTAGGTATC TCAGTTCGGT GTAGGTCGTT 
7200 

CGCTCCAAGC TGGGCTGTGT GCACGAACCG CCCGTTCAGC CCGACCGCTG CGCCTTATCC 
7260 

„ATC GTCTTGAGTC CAACCCGG.A AGACACGACT TATCGCCACT GGGAGCAGGG 
7320 

ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT CTTGAAGTGG 
7380 

TGGCCTAACT AGGGGTAGAG TAGAAGGACA GTA.TTGGTA TC.GGGG.GX GG.GAAGGCA 
7440 

GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AAGAAAGGAC CGCTGGTAGC 
7500 

GGTGGTTTTX TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC TGAAGAAGAT 
7560 

CCTTTGATCT TTTCTACGGG GTCTGACGCT GAGTGGAAGG AAAAGTGAGG TTAAGGGATT 
7620 

TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC TTTTAAATTA AAAATGAAGT 
7680 

TTT AAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG ACAGTTACCA ATGCTTAATC 
7740 
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AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT CCATAGTTGC CTGACTCCCC 
7800 

GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG GCGCCAGTGC TGCAATGATA 
7860 

CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC AGCCGGAAGG 
7920 

GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT TAATTGTTGC 
7980 

CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT TGCCATTGCT 
. 8040 

ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC CGGTTCCCAA 
8100 

CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT 
8160 

CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT TATGGCAGCA 
8220 

CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC TGGTGAGTAC 
8280 

TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG CCCGGCGTCA 
8340 

ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT TGGAAAACGT 
8400 - 

TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC GATGTAACCC 
8460 

ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC TGGGTGAGCA 
8520 
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«*rrr CGCAAAAAAG GGAATAAGGG CGACACGGAA ATGTTGAATA 
AAAACAGGAA GGCAAAATGC CGCAAAAAAG 

8580 

m ^ arr kTTTATC AGGGTTATTG TCTCATGAGC 
CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AG 

8640 

GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GCOTTCCCCC CACA^CCC 
8700 

CGAAAAGTGC 
8710 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i i) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

ATGGGATCCA AGATGAAGCG CGCAAGACCG 
30 

(2) INFORMATION FOR SEQ ID NO: 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CATAACGCGG CCGCTTCTTT ATTCTTGGGC 
30 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
60 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
120 

CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
180 



(ii). 
(iii)' 
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TTAGGGTTAG GCGTTTTGCG 



CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 



240 



****** TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
GATTATTGAC TAGTTATTAA TAGTAATCAA 

300 

TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
360 

CGCGGCGATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCGAATA GGGACTTTCC 
420 

ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
480 

ATCATATGCC AAGTACGCCC GCTATTGACG TCAATGAGGG TAAATGCGCG GCCTGGCATT 
540 

ATGCCCAGTA GATGAGCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 
600 

TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTAGATCAA TGGGCGTGGA TAGCGGTTTG 
660 

ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
720 

AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC CCCATTGACG CAAATGGGCG 
780 

GTAGGCGTGT ACGGTGGGAG GTCTATATAA GGAGAGCTGT CTGGCTAACT AGAGAAGCGA 
840 

CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGAGGGAA GCTTGGTACC 
900 

GAGCTCGGAT CCAAGATGAA GCGCGCAAGA CCGTCTGAAG ATACCTTCAA CCCCGTGTAT 



960 
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CCATATGACA CGGAAACCGG TCCTCCAACT GTGCCTTTTC TTACTCCTCC CTTTGTATCC 
1020 

CCCAATGGGT TTCAAGAGAG TCCCCCTGGG GTACTCTCTT TGCGCCTATC CGAACCTCTA 
1080 

GTTACCTCCA ATGGCATGCT TGCGCTCAAA ATGGGCAACG GCCTCTCTCT GGACGAGGCC 
1140 

GGCAACCTTA CCTCCCAAAA TGTAACCACT GTGAGCCCAC CTCTCAAAAA AACCAAGTCA 
1200 



TGGAAATATC TGCACCCCTC ACAGTTACCT CAGAAGCCCT AACTGTGGCT 
GCCGCCGCAC CTCTAATGGT CGCGGGCAAC ACACTCACCA TGCAATCACA GGCCCCGCTA 



AACATAAACC 
1260 



1320 



ACCGTGCACG ACTCCAAACT TAGCATTGCC ACCCAAGGAC CCCTCACAGT GTCAGAAGGA 
1380 

AAGCTAGCCC TGCAAACATC AGGCCCCCTC ACCACCACCG ATAGCAGTAC CCTTACTATC 
1440 

ACTGCCTCAC CCCCTCTAAC TACTGCCACT GGTAGCTTGG GCATTGACTT GAAAGAGCCC 
1500 

ATTTATACAC AAAATGGAAA ACTAGGACTA* AAGTACGGGG CTCCTTTGCA TGTAACAGAC 
1560 

GACCTAAACA CTTTGACCGT AGCAACTGGT CCAGGTGTGA CTATTAATAA TACTTCCTTG 
1620 

CAAACTAAAG TTACTGGAGC CTTGGGTTTT GATTCACAAG GCAATATGCA ACTTAATGTA 
1680 



GCAGGAGGAC TAAGGATTGA TTCTCAAAAC AGACGCCTTA TACTTGATGT TAGTTATCCG 
1740 • 
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mWSCTC AAAACGAACT AAATGTAAGA CTAGGACAGG GCCCTCTTTT TATAAACTCA 



1800 



CCCCACAACT TGGATATTAA CTACAACAAA GGCCTTTACT TGTTTACAGC TTCAAACAAT 



1860 



TCCAAAAAGC TTGAGGTTAA CCTAAGCACT GCCAAGGGGT TGATGTTTGA GGGTACAGCG 
1920 

ATAGCCATTA ATGCAGGAGA TGGGCTTGAA TTTGGTTCAC CTAATGCACC AAACACAAAT 
1980 

CCCCTCAAAA CAAAAATTGG CCATGGCCTA GAATTTGATT CAAACAAGGC TATGGTTCCT 
2040 

AAACTAGGAA CTGGCCTTAG TTTTGACAGC ACAGGTGCCA TTACAGTAGG AAACAAAAAT 
2100 

AATGATAAGC TAACTTTGTG GACCACACCA GCTCCATCTC CTAACTGTAG ACTAAATGCA 
2160 

GAGAAAGATG CTAAACTCAC TTTGGTCTTA ACAAAATGTG GCAGTCAAAT ACTTGCTACA 
2220 

GTTTCAGTTT TGGCTGTTAA AGGCAGTTTG GCTCCAATAT CTGGAACAGT TCAAAGTGCT 
2280 

CATCTTATTA TAAGATTTGA CGAAAATGGA GTGCTACTAA ACAATTCCTT CCTGGACCCA 
2340 

GAATATTGGA ACTTTAGAAA TGGAGATCTT ACTGAAGGCA CAGCCTATAC AAACGCTGTT 
2400 

GGATTTATGC CTAACCTATC AGCTTATCCA AAATCTCACG GTAAAACTGC GAAAAGTAAG 
2460 

~™ *iirrr*.r*C AAAACTAAAC CTGTAACACT AACCATTACA 
ATTGTCAGTC AAGTTTACTT AAACGGAGAC AAAA<-i/w*_ 

2520 
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CTAAACGGTA CACAGGAAAC AGGAGACACA ACTCCAAGTG CATACTCTAT GTCATTTTCA 
2580 

TGGGACTGGT CTGGCCACAA CTACATTAAT GAAATATTTG CCACATCCTC TTACACTTTT 
2640 

TCATACATTG CCCAAGAATA AAGAAGCGGC CGCTCGAGCA TGCATCTAGA GGGCCCTATT 
2700 

CTATAGTGTC ACCTAAATGC TAGAGCTCGC TGATCAGCCT CGACTGTGCC TTCTAGTTGC 
2760 

CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG TGCCACTCCC 
2820 

ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG GTGTCATTCT 
2880 

ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA CAATAGCAGG 
2940 

CATGCTGGGG ATGCGGTGGG CTCTATGGCT TCTGAGGCGG AAAGAACCAG CTGGGGCTCT 
3000 

AGGGGGTATC CCCACGCGCC CTGTAGCGGC GCATTAAGCG CGGCGGGTGT GGTGGTTACG 
3060 

CGCAGCGTGA CCGCTACACT TGCCAGCGCC CTAGCGCCCG CTCCTTTCGC TTTCTTCCCT 
3120 

TCCTTTCTCG CCACGTTCGC CGGCTTTCCC CGTCAAGCTC TAAATCGGGG CATCCCTTTA 
3180 

GGGTTCCGAT TTAGTGCTTT ACGGCACCTC GACCCCAAAA AACTTGATTA GGGTGATGGT 
3240 

TCACGTAGTG GGCCATCGCC CTGATAGACG GTTTTTCGCC CTTTGACGTT GGAGTCCACG 
3300 
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TTCTTTAATA —AAAC, GGAAGAACAC ««CC« -GGTCTA, 

3360 

TCTTTTGATT TATAAGGGAT TTTGGGGATT TCGGCCTATT GGTTAAAAAA TGAGCTGATT 
3420 

TAACAAAAAT TTAACGCGAA TTAATTCTGT GGAATGTGTG TCAGTTAGGG TGTGGAAAGT 
3480 

CCCCAGGCTC CCCAGGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA GTCAGCAACC 
3540 

AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA TGCAAAGCAT GCATCTCAAT 
3600 

TAGTCAGCAA CCATAGTCCC GCCCCTAACT CCGCCCATGC CGCCCGTAAG TCCGCCCAGT 
3660 

TOT CCCATT CTCCGCCCCA TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC 
3720 

OCCTCTGCCT CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG CCTAGGCTTT 
3780 

TGCAAAAAGC TCCCGGGAGC TTGTATATCC ATTTTCGGAT CTGATCAAGA GACAGGATGA 
3840 

GGATCGTTTC GCATGATTGA ACAAGATGGA TTGCACGCAG GTTCTCCGGC CGCTTGGGTG 
3900 

GAGAGGCTAT TCGGCTATGA CTGGGCACAA CAGACAATCG GCTGCTCTGA TCCCGCCGTG 
3960 

TTCCGGCTGT CAGCGCAGGG GCGCCCGGTT CTTTTTGTCA AGACCGACCT GTCGGG^CG 



4020 

CTGAATGAAC TGCAGGACGA 
4080 



GGCAGCGCGG CTATCGTGGC TGGCCACGAC GGGCGTTCCT 
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TGCGCAGCTG TGCTCGACGT TGTCACTGAA GCGGGAAGGG ACTGGCTGCT ATTGGGCGAA 
4140 

GTGCCGGGGC AGGATCTCCT GTCATCTCAC CTTGCTCCTG CCGAGAAAGT ATCCATCATG 
4200 

GCTGATGCAA TGCGGCGGCT GCATACGCTT GATCCGGCTA CCTGCCCATT CGACCACCAA 
4260 

GCGAAACATC GCATCGAGCG AGCACGTACT CGGATGGAAG CCGGTCTTGT CGATCAGGAT 
4320 

GATCTGGACG AAGAGCATCA GGGGCTCGCG CCAGCCGAAC TGTTCGCCAG GCTCAAGGCG 
4380 

CGCATGCCCG ACGGCGAGGA TCTCGTCGTG ACCCATGGCG ATGCCTGCTT GCCGAATATC 
4440 

ATGGTGGAAA ATGGCCGCTT TTCTGGATTC ATCGACTGTG GCCGGCTGGG TGTGGCGGAC 
4500 

CGCTATCAGG ACATAGCGTT GGCTACCCGT GATATTGCTG AAGAGCTTGG CGGCGAATGG 
4560 

GCTGACCGCT TCCTCGTGCT TTACGGTATC GCCGCTCCCG ATTCGCAGCG CATCGCCTTC 
4620 

TATCGCCTTC TTGACGAGTT CTTCTGAGCG GGACTCTGGG GTTCGAAATG ACCGACCAAG 
4680 

CGACGCCCAA CCTGCCATCA CGAGATTTCG ATTCCACCGC CGCCTTCTAT GAAAGGTTGG 
4740 

GCTTCGGAAT CGTTTTCCGG GACGCCGGCT GGATGATCCT CCAGCGCGGG GATCTCATGC 
4800 

TGGAGTTCTT CGCCCACCCC AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 
4860 
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ATAGCATCAC AAATTTCACA 



AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT 



4920 



CCAAAGTCAT CAATGTATCT TATCATGTCT GTATACCGTC GACGTCTAGC TAGACGTTGG 



4980 



CGTAATCATG GTCATAGCTG TTTCCTGTGT GAAATTGTTA TGCGCTGAGA ATTCCACACA 



5040 



ACATACGAGC CGGAAGCATA AAGTGTAAAG CCTGGGGTGC CTAATGAGTG AGCTAACTCA 
5100 

CATTAATTGC GTTGCGCTCA CTGCCCGCTT TCCAGTCGGG AAACCTGTCG TGCCAGCTGC 
5160 

ATTAATGAAT CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGGCGC TCTTCCGCTT 
5220 

CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GGGAGCGGTA TCAGCTCACT 



5280 



CAAAGGCGGT AATACGGTTA TCGACAGAAT CAGGGGATAA CGCAGGAAAG AACATGTGAG 
5340 

CAAAAGGCCA GCAAAAGGCC AGGAACCGTA AAAAGGCCGC GTTGCTGGCG TTTTTCCATA 
5400 

GG CTCCGCCC CCCTGACGAG CATCACAAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC 
5460 

mrrcCTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG 
CGACAGGACT ATAAAGATAC CAGGCGTTTC CUU^ 

5520 

TTCCGACCCT GCCGCTTAGC GGATACCTGT CCGCCTTTCT CCCTTCGGGA AGCGTGGCGC 
5580 

TTTCTCAATG CTCACGCTGT AGGTATCTCA GTTCGGTGTA GGTCGTTCGC TCCAAGCTGG 



5640 
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GCTGTGTGCA CGAACCCCCC GTTCAGCCCG ACCGCTGCGC CTTATCCGGT AACTATCGTC 
5700 

TTGAGTCCAA CCCGGTAAGA CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA 
5760 

TTAGCAGAGC GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG- CCTAACTACG 1 
5820 

GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT ACCTTCGGAA 
5880 

AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC TGGTAGCGGT GGTTTTTTTG 
5940 

TTTGCAAGCA GCAGATTACG CGCAGAAAAA AAGGATCTCA AGAAGATCCT TTGATCTTTT 
6000 

CTACGGGGTC TGACGCTCAG TGG AAC G AAA ACTCACGTTA AGGGATTTTG GTCATGAGAT 
6060 

TATCAAAAAG GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
6120 

AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
. 6180 

TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC GTGTAGATAA 
6240 

CTACGATACG GGAGGGCTTA CCATCTGGCC CCAGTGCTGC AATGATACCG CGAGACCCAC 
6300 

GCTCACCGGC TCCAGATTTA TCAGCAATAA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA 
6360 



GTGGTCCTGC AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 
6420 
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TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC GATTGCTACA GGCATCGTGG 
6480 

m „™ ^rrr-TTCKT TCAGCTCCGG TTCCCAACGA TCAAGGCGAG 
TGTCACGCTC GTCGTTTGGT ATGGCTTCAT 

6540 

...»»&r rr rTTAGCTC CTTCGGTCCT CCGATCGTTG 
TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC CTl 

6600 

TCAGAAGTAA GTTGGCCGCA GTGTTATCAC TCATGGTTAT GGCAGCACTG CATAATTCTC 
6660 

TTACTGTCAT GCCATCCGTA AGATGCTTTT CTGTGAGTGG TGAGTACTCA ACCAAGTCAT 
6720 

TCTGAGAATA GTGTATGCGG GGACCGAGTT GCTCTTGCCC GGCGTCAATA CGGGATAATA 
6780 

~ „ mlll rTrr TCATCATTGG AAAACGTTCT TCGGGGCGAA 
CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCAT1W, 

6840 

AACTCTCAAG GATCTTACCG CTGTTGAGAT GCAGTTGGAT GTAACCCACT GGTGCACCCA 
6900 

ACTGATCTTC AGCATCTTTT ACTTTCAGGA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC 
6960 

^OGCGC AAAAAAGGGA ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC 
7020 

TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA TACATATTTG 
7080 

CAAATAGGGG TTCCGCGCAC ATTTCCCCGA AAAGTGCCAC 



AATGTATTTA GAAAAATAAA 
7140 

CTGACGTC 
7148 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GACGGATCGG GAGATCTCCC GATCCCCTAT GGTCGACTCT CAGTACAATC TGCTCTGATG 
60 

CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT GGAGGTCGCT GAGTAGTGCG 
120 

CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA CAATTGCATG AAGAATCTGC 
180 

TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC CAGATATACG CGTTGACATT 
240 

GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
300 

TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC TGGCTGACCG CCCAACGACC 
360 

CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT AACGCCAATA GGGACTTTCC 
420 



» 1 
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TATTTACGGT AAACTGCCCA CTTGGCAGTA CATCAAGTGT 
ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG TAAATGGCCC GCCTGGCATT 



ATTGACGTCA ATGGGTGGAC 
480 



540 



ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA GTACATCTAC GTATTAGTCA 



600 



TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA TAGCGGTTTG 
660 

ACTCACGGGG ATTTCCAAGT CTGCACCCGA TTGACGTCAA TGGGAGTTTG TTTTGGCACC 
720 

AAAATCAACG GGACTTTCCA AAATGTCGTA AGAACTGGGC CCCATTGACG CAAATGGGCG 
780 

GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT CTGGCTAACT AGAGAACCCA 
840 

CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG GGAGACCCAA GCTTGGTACC 
900 

GAGCTCGGAT CTGAATTCGA GCTCGCTGTT GGGCTCGCGG TTGAGGACAA ACTCTTCGCG 
960 

GTCTTTCCAG TACTCTTGGA TCGGAAACCC GTCGGCCTCC GAACGGTACT CCGCCACCGA 
1020 

GGGACCTGAG CGAGTCCGCA TCGACCGGAT CGGAAAACCT CTCGAGAAAG GCGTCTAACC 
1080 

AGTCACAGTC GCAAGGTAGG CTGAGCACCG TGGCGGGCGG CAGCGGGTGG CGGTCGGGGT 



1140 

TGTTTCTGGC 
1200 



GGAGGTGCTG CTGATGATGT AATTAAAGTA GGCGGTCTTG AGACGGCGGA 
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TGGTCGAGGT GAGGTGTGGC AGGCTTGAGA TCCAAGATGA AGCGCGCAAG ACCGTCTGAA 
1260 

GATACCTTCA ACCCCGTGTA TCCATATGAC ACGGAAACCG GTCCTCCAAC TGTGCCTTTT 
1320 

CTTACTCCTC CCTTTGTATC CCCCAATGGG TTTCAAGAGA GTCCCCCTGG GGTACTCTCT 
1380 

TTGCGCCTAT CCGAACCTCT AGTTACCTCC AATGGCATGC TTGCGCTCAA AATGGGCAAC 
1440 

GGCCTCTCTC TGGACGAGGC CGGCAACCTT ACCTCCCAAA ATGTAACCAC TGTGAGGCCA 
1500 

CCTCTCAAAA AAACCAAGTC AAACATAAAC CTGGAAATAT CTGCACCCCT CACAGTTACC 
1560 

TCAGAAGCCC TAACTGTGGC TGCCGCCGCA CCTCTAATGG TCGCGGGCAA CACACTCACC 
1620 

ATGCAATCAC AGGCCCCGCT AACCGTGCAC GACTCCAAAC TTAGCATTGC CACCCAAGGA 
1680 

CCCCTCACAG TGTCAGAAGG AAAGCTAGCC CTGCAAACAT CAGGCCCCCT CACCACCACC 
1740 

GATAGCAGTA CCCTTACTAT CACTGCCTCA CCCCCTCTAA CTACTGCCAC TGGTAGCTTG 
1800 

GGCATTGACT TGAAAGAGCC CATTTATACA CAAAATGGAA AACTAGGACT AAAGTACGGG 
1860 

GCTCCTTTGC ATGTAACAGA CGACCTAAAC ACTTTGACCG TAGCAACTGG TCCAGGTGTG 
1920 

ACTATTAATA ATACTTCCTT GCAAACTAAA GTTACTGGAG CCTTGGGTTT TGATTCACAA 
1980 
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«~p i rr AGGAGGA CTAAGGATTG ATTCTCAAAA CAGACGCCTT 
GGCAATATGC AACTTAATGT AGCAGGAGGA CI 

2040 

ATACTTGATG TTAGTTATCC GTTTGATGCT CAAAACCAAC TAAATCTAAG ACTAGGACAG 
2100 

GGCCCTCTTT TTATAAACTC AGGCGAGAAC TTGGATATTA ACTACAACAA AGGCCTTTAC 
2160 

XTGTTTACAG CTTCAAACAA TTCCAAAAAG CTTGAGGTTA ACCTAAGCAC TGCGAAGGGG 
2220 

TTGATGTTTG ACGCTACAGC CATAGCCATT AATGCAGGAG ATGGGCTTGA ATTTGGTTCA 
2280 

CCTAATGCAC CAAACACAAA TCCCCTCAAA ACAAAAATTG GCCATGGCCT AGAATTTGAT 
2340 

TCAAACAAGG CTATGGTTCC TAAACTAGGA ACTGGCCTTA GTTTTGACAG CACAGGTGCC 
2400 

ATTACAGTAG GAAACAAAAA TAATGATAAG CTAACTTTGT GGACCACACC AGCTCCATCT 
2460 



CCTAACTGTA GACTAAATGC AGAGAAAGAT GGTAAACTCA CTTTGGTCTT AACAAAATGT 
2520 

GGCAGTCAAA TACTTGCTAC AGTTTCAGTT TTGGCTGTTA AAGGCAGTTT GGCTCCAATA 
2580 

TCTGGAACAG TTCAAAGTGC TCATCTTATT ATAAGATTTG ACGAAAATGG AGTGCTACTA 
2640 

AACAATTCCT TCCTGGACCC AGAATATTGG AACTTTAGAA ATGGAGATCT TACTGAAGGC 
2700 

ACAGCCTATA CAAACGCTGT TGGATTTATG CC.AACCTAT CAGCTTATCC AAAATCTCAC 



2760 
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GGTAAAACTG CCAAAAGTAA CATTGTCAGT CAAGTTTACT TAAACGGAGA CAAAACTAAA 
2820 

CCTGTAACAC TAACCATTAC ACTAAACGGT ACACAGGAAA CAGGAGACAC AACTCCAAGT 
2880 

GCATACTCTA TGTCATTTTC ATGGGACTGG TCTGGCCACA AC TAG ATT AA TGAAATATTT 
2940 

GCCACATCCT CTTACACTTT TTCATACATT GCCCAAGAAT AAAGAAGCGG CCGCTCGAGC 
3000 

ATGCATCTAG AGGGCCCTAT TCTATAGTGT CACCTAAATG CTAGAGCTCG CTGATCAGCC 
3060 

TCGACTGTGC CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG 
3120 

ACCCTGGAAG GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT 
3180 

TGTCTGAGTA GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG 
3240 

GATTGGGAAG ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGC TTCTGAGGCG 
3300 

GAAAGAACCA GCTGGGGCTC TAGGGGGTAT CCCCACGCGC CCTGTAGCGG CGCATTAAGC 
3360 

GCGGCGGGTG TGGTGGTTAC GCGCAGCGTG ACCGCTACAC TTGCCAGCGC CCTAGCGCCC 
3420 

GCTCCTTTCG CTTTCTTCCC TTCCTTTCTC GCCACGTTCG CCGGCTTTCC CCGTCAAGCT 
3480 

CTAAATCGGG GCATCCCTTT AGGGTTCCGA TTTAGTGCTT TACGGCACCT CGACCCCAAA 
3540 
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„AT, AGGGXGA.GG TTCACGTAG* GGGCCATCGC CCGATAGAG GGTTTTTCGC 
3600 

Ccm »CGT TGGAGTCCAC GTTCTTTAAT AGTGGACTCT TGTTCCAAAC TGGAACAACA 
3660 

CTCAACCCTA TCTCGGTCTA TTCTTTTGAT TTATAAGGGA TTTTGGGGAT TTCGGCCTAT 
3720 

TGGTTAAAAA ATGAGCTGAT TTAACAAAAA TTTAACGCGA ATTAATTCTG TGGAATGTGT 
3780 

GTCAGTTAGG GTGTGGAAAG TCCCCAGGCT CCCCAGGCAG GCAGAAGTAT GCAAAGCATG 
3840 

CATCTCAATT AGTCAGCAAC CAGGTGTGGA AAGTGCCCAG GCTCCCCAGC AGGCAGAAGT 
3900 

ATGCAAAGCA TGCATCTCAA TTAGTCAGCA ACCATAGTCC CGCGCGTAAC TGCGGCGATC 
3960 

CCGCCCCTAA CTCGGCCCAG TTCGGCGCAT TCTGCGGGGC ATGGCTGACT AATTTTTTTT 
4020 

ATTTATGCAG AGGCCGAGGC CGCCTGTGCC TCTGAGCTAT TCCAGAAGTA GTGAGGAGGC 
4080 

TTTTTTGGAG GCCTAGGCTT TTGCAAAAAG CTCCCGGGAG CTTGTATATC CATTTTCGGA 
4140 

AGGATCGTTT CGCATGATTG AACAAGATGG ATTGCACGCA 



TCTGATCAAG AGACAGGATG 
4200 



GGTTCTCCGG CCGCTTGGGT GGAGAGGCTA TTCGGCTATG ACTGGGGAGA AGAGACAATG 



4260 



GGCTGCTCTG ATGCCGCCGT GTTCCGGCTG TGAGCGCAGG GGCGCCCGGT TCTTITTGTC 



4320 
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AAGACCGACC TGTCCGGTGC CCTGAATGAA CTGCAGGACG AGGCAGCGCG GCTATCGTGG 
4380 

CTGGCCACGA CGGGCGTTCC TTGCGCAGCT GTGCTCGACG TTGTCACTGA AGCGGGAAGG 
4440 

GACTGGCTGC TATTGGGCGA AGTGCCGGGG CAGGATCTCC TGTCATCTCA CCTTGCTCCT 
4500 

GCCGAGAAAG TATCCATCAT GGCTGATGCA ATGCGGCGGC TGCATACGCT TGATCCGGCT 
4560 

ACCTGCCCAT TCGACCACCA AGCGAAACAT CGCATCGAGC GAGCACGTAC TCGGATGGAA 
4620 

GCCGGTCTTG TCGATCAGGA TGATCTGGAC GAAGAGCATC AGGGGCTCGC GCCAGCCGAA 
4680 

CTGTTCGCCA GGCTCAAGGC GCGCATGCCC GACGGCGAGG ATCTCGTCGT GACCCATGGC 
4740 

GATGCCTGCT TGCCGAATAT CATGGTGGAA AATGGCCGCT TTTCTGGATT CATCGACTGT 
4800 

GGCCGGCTGG GTGTGGCGGA CCGCTATCAG GACATAGCGT- TGGCTACCCG TGATATTGCT 
4860 

GAAGAGCTTG GCGGCGAATG GGCTGACCGC TTCCTCGTGC TTTACGGTAT CGCCGCTCCC 
4920 

GATTCGCAGC GCATCGCCTT CTATCGCCTT CTTGACGAGT TCTTCTGAGC GGGACTCTGG 
4980 

' GGTTCGAAAT GACCGACCAA GCGACGCCCA ACCTGCCATC ACGAGATTTC GATTCCACCG 
5040 

CCGCCTTCTA TGAAAGGTTG GGCTTCGGAA TCGTTTTCCG GGACGCCGGC TGGATGATCC 
5100 
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TC CAGCGGG= GGATCTCATG CTGGAGTTCT TCGCGCACCG CAACTTGTTT ATTGCAGCTT 
5160 

atsat gg™ caaataaagc aataggatga gaaat^ac aaataaagca ttt™ 

5220 

TGCATTCTAG TTGTGGTTTG TCCAAACTCA TCAATGTATC TTATCATGTC TGTATACCGT 
5280 

CGACCTCTAG CTAGAGCTTG GCGTAATCAT GGTCATAGCT GTTTCCTGTG TGAAATTGTT 
5340 

ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT AAAGTGTAAA GCCTGGGGTC 
5400 

CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTCCGCTC ACTGCCCGCT TTCCAGTCGG 
5460 



GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG CGCGGGGAGA GGCGGTTTGC 
5520 

GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT GCGCTCGGTC GTTCGGCTGC 
5580 

„ n,nmrrrrc TAATACGGTT ATCCACAGAA TCAGGGGATA 
GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACW.il * 

5640 

ACGCAGGAAA GAACATGTGA GCAAAAGGCG AGCAAAAGGC CAGGAACCGT AAAAAGGCGG 
5700 

CGTTGCTGGC GTTTTTGGAT AGGCTCCGCC CCCCTGAGGA GCATCACAAA AATCGACGCT 
5760 

CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA 
5820 

GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG TCCGCCTTTC 



5880 
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TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG TAGGTATCTC AGTTCGGTGT 
5940 

AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC GACCGCTGCG 
6000 

CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG 
6060 

CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT 
6120 

TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC TGCGCTCTGC 
6180 

TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA CAAACCACCG 
6240 . 

CTGGTAGCGG TGGTTTTTT.T GTTTGCAAGC AGCAGATTAC GCGCAGAAAA AAAGGATCTC 
6300 

AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT 
63 60 

AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA 
6420 

AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC AGTTACCAAT 
6480 

GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCGTTCATCC ATAGTTGCCT 
6540 

GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC CCCAGTGCTG 
6600 

CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA AACCAGCCAG 
6660 
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CO.— C =GAG==CAGA CTG CGCCTCCATC CAGTCTATTA 

6720 

ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC AACGTTGTTC 
6780 

CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA TTCAGCTCCG 
6840 

GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA GCGGTTAGCT 
6900 

CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA CTCATGGTTA 
6960 

TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT TCTGTGACTG 
7020 

GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT TGCTCTTGCC 



7080 



ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG CTCATCATTG 



CGGCGTCAAT ACGGGATAAT 
7140 

«~m~mr*%* rr^TrTTACC GCTGTTGAGA TCCAGTTCGA 
GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC 

7200 

TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC AGCGTTTCTG 
7260 

^ , 1111T rrrG CAAAAAAGGG AATAAGGGCG ACACGGAAAT 
GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAA*^ 

7320 

GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG GGTTATTGTC 
7380 

TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGO GTTCCGCGCA 
7440 
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- 95 - 



CATTTCCCCG AAAAGTGCCA CCTGACGTC 
7469 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGCTTAAGCG GCCGCGAAGG AGAAGTCC 
28 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCGAGCTAGC GACTGAAAAT GAG 



23 



(2 ) INFORMATION FOR SEQ ID NO:U: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(i i) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi J SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CCTCTCGAGA GACAGCAAGA CAC 
23 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

AAGCTTGGGC AGAAATGGTT GAACTCCCGA GAGTGTCCTA CACCTAGGGG AGAAGCAGCC 
60 

AAGGGGTTGT TTCCCACCAA GGACGACCCG TCTGCGCACA AACGGATGAG CCCATCAGAC 
120 

AAAGACATAT TCATTCTCTG CTGCAAACTt GGCATAGCTC TGCTTTGCCT GGGGCTATTG 
180 

GGGGAAGTTG CGGTTCGTGC TCGCAGGGCT CTCACCCTTG ACTCTTTTAA TAGCTCTTCT 
240 

GTGCAAGATT ACAATCTAAA CAATTCGGAG AACTCGACCT TCCTCCTGAG GCAAGGACCA 
300 

CAGCCAACTT CCTCTTACAA GCCGCATCGA TTTTGTCCTT CAGAAATAGA AATAAGAATG 

•3 60 

CTTGCTAAAA ATTATATTTT TACCAATAAG ACCAATCCAA TAGGTAGATT ATTAGTTACT 
420 

ATGTTAAGAA ATGAATCATT ATCTTTTAGT ACTATTTTTA CTCAAATTCA GAAGTTAGAA 
480 

ATGGGAATAG AAAATAGAAA GAGACGCTCA ACCTCAATTG AAGAACAGGT GCAAGGACTA 
540 

TTGACCACAG GCCTAGAAGT AAAAAAGGGA AAAAAGAGTG TTTTTGTCAA AATAGGAGAC 
600 

AGGTGGTGGC AACCAGGGAC TTATAGGGGA CCTTACATCT ACAGACCAAC AGATGCCCCC 
660 

TT AC CAT AT A CAGGAAGATA TGACTTAAAT TGGGATAGGT GGGTTACAGT CAATGGCTAT 
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r ^ rrpT rrcTTTTCGT GAAAGACTCG CCAGAGCTAG ACCTCCTTGG 
AAAGTGTTAT ATAGATCCCT CCCTTTTC^i 

780 

TGTATGTTGT GTCAAGAAGA AAAAGAGGAG ™ C AGGTACATGA TTATATTTAT 

840 

CTAGGAACAG GAATGCACTT TTGGGGAAAG ATTTTCCATA CCAAGGAGGG GACAGTGGCT 



900 



GGACTAATAG AACATTATTC TGGAAAAACT CATGGGATGA GTTATTATGA ATAGCCTTTA 
960 

TTGGGGGAAG CTTGCGGTTC GGAGGGGTTA AGTAAGTTTT TGGTTACAAA CTGTTCTTAA 
1020 

AACGAGGATG TGAGACAAGT GGTTTCCTGA CTTGGTTTGG TATCAAAGGT TCTGATCTGA 
1080 

GCTCTGAGTG TTCTATTTTC CTATGTTCTT TTGGAATTTA TCCAAATCTT ATGTAAATGC 
1140 

TTATGTAAAC CAAGATATAA AAGAGTGCTG ATTTTTTGAG TAAACTTGCA ACAGTCCTAA 
1200 

CATTCACCTC TTGTGTGTTT GTGTCTGTTC GCCATCCCGT CTCCGCTCGT CACTTATCCT 
1260 

TCACTTTCCA GAGGGTCCCC CCGCAGACCC CGGCGACCCT CAGGTCGGCC GACTGCGGCA 
1320 

GCTGGCGCCC GAACAGGGAC CCTCGGATAA GTGACCCTTG TCTCTATTTC TACTATTTGG 



1380 

TGTTTGTCTT 
1440 



GTATTGTCTC TTTCTTGTCT GGCTATCATC ACAAGAGCGG AACGGACTCA 



CCATAGGGAC CAAGCTAGCG ACTGAAAATG AGACATATTA TCTGCCACGG AGGTGTTATT 
1500 



WO 98/13499 
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-99- 

ACCGAAGAAA TGGCCGCCAG TCTTTTGGAC CAGCTGATCG AAGAGGTACT GGCTGATAAT 
1560 

CTTCCACCTC CTAGCCATTT TGAACCACCT ACCCTTCACG AACTGTATGA TTTAGACGTG 
1620 

ACGGCCCCCG AAGATCCCAA CGAGGAGGCG GTTTCGCAGA TTTTTC.CCGA CTCTGTAATG 
1680 

' TTGGCGGTGC AGGAAGGGAT TGACTTACTC ACTTTTCCGC CGGCGCCCGG TTCTCCGGAG 
1740 

CCGCCTCACC TTTCCCGGCA GCCCGAGCAG CCGGAGCAGA GAGCCTTGGG TCCGGTTTCT 
1800 

ATGCCAAACC TTGTACCGGA GGTGATCGAT CTTACCTGCC ACGAGGCTGG CTTTCCACCC 
1860 

AGTGACGACG AGGATGAAGA GGGTGAGGAG TTTGTGTTAG ATTATGTGGA GCACCCCGGG 
1920 

CACGGTTGCA GGTCTTGTCA TTATCACCGG AGGAATACGG GGGACCCAGA TATTATGTGT 
1980 

TCCSCTTTGCT ATATGAGGAC CTGTGGCATG TTTGTCTACA GTAAGTGAAA ATTATGGGCA 
2040 

GTGGGTGATA GAGTGGTGGG TTTGGTGTGG TAATTTTTTT TTTAATTTTT ACAGTTTTGT 
2100 

GGTTTAAAGA ATTTTGTATT GTGATTTTTT TAAAAGGTCC TGTGTCTGAA CCTGAGCCTG 
2160 

AGCCCGAGCC AGAACCGGAG CCTGCAAGAC .CTACCCGCCG TCCTAAAATG GCGCCTGCTA 
2220 

TCCTGAGACG CCCGACATCA CCTGTGTCTA GAGAATGCAA TAGTAGTACG GATAGCTGTG 
2280 



WO 98/13499 



-100- 
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m _. rl TACACCCGGT GGTCCCGCTG TGCCCCATTA 
ACTCCGGTCC TTCTAACACA CCTCCTGAGA TACACCCGGI 

2340 

AACCAGTTGC CGTGAGAGTT GGTGGGCGTC GCCAGGCTGT GGAATGTATC GAGGACTTGC 
2400 

TTAACGAGCC TGGGCAACCT TTGGACTTGA GCTGTAAACG GCGGAGGCCA TAAGGTGTAA 
2460 

ACCTGTGATT GCGTGTGTGG TTAACGCCTT TGTTTGCTGA ATGAGTTGAT GTAAGTTTAA 
2520 

TAAAGGGTGA GATAATGTTT AACTTGCATG GCGTGTTAAA TGGGGCGGGG CTTAAAGGGT 
2580 

ATATAATGCG GCGTGGGCTA ATCTTGGTTA CATCTGACCT CATGGAGGCT TGGGAGTGTT 
2640 

TGGAAGATTT TTCTGCTGTG CGTAACTTGC TGGAAGAGAG CTCTAACAGT ACCTCTTGGT 
2700 

TTTGGAGGTT TCTGTGGGGC TCATCCCAGG CAAAGTTAGT CTGCAGAATT AAGGAGGATT 
2760 

ACAAGTGGGA ATTTGAAGAG CTTTTGAAAT CCTGTGGTGA GCTGTTTGAT TCTTTGAATC 
2820 

TGGGTCACCA GGCGCTTTTC CAAGAGAAGG TCATCAAGAC TTTGGATTTT TCCACACCGG 
2880 

OOCGCGCTGC GGCTGCTGTT GCTTTTTTGA GTTTTATAAA GGATAAATGG AGCGAAGAAA 
2940 

CCCATCTGAG CGGGGGGTAC CTGCTGGATT TTCTGGCCAT GCATCTGTGG AGAGCGGTTG 
3000 

TGAGACACAA GAATCGCCTG CTACTGTTGT CTTCCGTCCG CCCGGCGATA ATACCGACGG 



3060 



WO 98/13499 



-101 - 
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AGGAGCAGCA GCAGCAGCAG GAGGAAGCCA GGGGGCGGCG GCAGGAGCAG AGCCCATGGA 
3120 

ACCCGAGAGC CGGCCTGGAC CCTCGGGAAT GAATGTTGTA CAGGTGGCTG AACTGTATCC 
3180 

AGAACTGAGA CGCATTTTGA CAATTACAGA GGATGGGCAG GGGCTAAAGG GGGTAAAGAG 
3240 

GGAGCGGGGG GCTTGTGAGG CTACAGAGGA GGCTAGGAAT CTAGCTTTTA GCTTAATGAC 
3300 

CAGACACCGT CCTGAGTGTA TTACTTTTCA ACAGATCAAG GATAATTGCG CTAATGAGCT 
3360 

TGATCTGCTG GCGCAGAAGT ATTCCATAGA GCAGCTGACC ACTTACTGGC TGCAGCCAGG 
3420 

GGATGATTTT GAGGAGGCTA TTAGGGTATA TGCAAAGGTG GCACTTAGGC CAGATTGCAA 
3480 

GTACAAGATC AGCAAACTTG TAAATATCAG GAATTGTTGC TACATTTCTG GGAACGGGGC 
3540 

CGAGGTGGAG ATAGATACGG AGGATAGGGT GGCCTTTAGA TGTAGCATGA TAAATATGTG 
3 600 

GCCGGGGGTG CTTGGCATGG ACGGGGTGGT TATTATGAAT GTAAGGTTTA CTGGCCCCAA 
3660 

TTTTAGCGGT ACGGTTTTCC TGGCCAATAC CAACCTTATC CTACACGGTG TAAGCTTCTA 
3720 

TGGGTTTAAC AATACCTGTG TGGAAGCCTG GACCGATGTA AGGGTTCGGG GCTGTGCCTT 
3780 

TTACTGCTGC TGGAAGGGGG TGGTGTGTCG CCCCAAAAGC AGGGCTTCAA TTAAGAAATG 
3840 



WO 98/13499 



-102- 
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CCTCTTTGAA AGGTGTACCT — CT «»« «™C -GCCACAA 
3900 

TGT GGGCTGC GACTGTGGTT GCTGATGGT AGTGAAAAGG GTGGGTGTGA TTAAGGATAA 
3960 

CATGGTATGT GGCAACTGCG AGGACAGGGC ^ CTGACCTGCT CGGAGGGCAA 
4020 

CTGTCACCTG CTGAAGACCA TTCACGTAGC CAGCCACTCT CGCAAGGCCT GGCCAGTGTT 
4080 

TGAGCATAAC ATAC.GACCC GCTGTTCCTT GCATTTGGGT AACAGGAGGG GGGTGTTCCT 
4140 

ACCTTACCAA TGCAATTTGA GTCACACTAA GATATTGCTT GAGGGCGAGA GCATGTCCAA 
4200 

GGTGAACCTG AACGGGGTGT TTGACATGAG GATGAAGATC TGGAAGGTGC TGAGGTACGA 
4260 

TGAGACCCGC ACCAGGTGCA GACCGTGCGA GTGTGGCGGT AAACATATTA GGAAGCAGGC 
4320 

rrrrrKTCAC TTGGTGCTGG CCTGCACCCG 
TGTGATGCTG GATGTGACCG AGGAGCTGAG GCCCGATCAC TTC 

4380 

CGCTGAGTTT GGCTCTAGGG ATGAAGATAC AGATTGAGGT ACTGAAATGT GTGGGCGTGG 
4440 

CTTAAGGGTG GGAAAGAATA TATAAGGTGG GGGTCTTATG TAGTTTTGTA TCTGTTTTGG 
4500 

AGCAGCCGCC GGGGCCATGA GGAGCAAGTG GTTTGATGGA AGCATTGTGA GCTCATATTT 
4560 

GACAAGGCGC ATGCCGCGAT GGGCCGGGGT GGGTCAGAAT GTGATGGGCT GCAGCATTCA 
4620 



WO 98/13499 



-103- 



PCT/EP97/05251 



TGGTCGCCCC GTCCTGCCCG CAAACTCTAC TACCTTGACC TACGAGACCG TGTCTGGAAC 
4680 

GCCGTTGGAG ACTGCAGCCT CCGCCGCCGC TTCAGCCGCT GCAGCCACCG CCCGCGGGAT 
4740 

TGTGACTGAC TTTGCTTTCC TGAGCCCGCT TGCAAGCAGT GCAGCTTCCC GTTCATCCGC 
4800 

CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC GGGAACTTAA 
4860 

TGTCGTTTCT CAGCAGCTGT TGGATCTGCG CCAGCAGGTT TCTGCCCTGA AGGCTTCCTC 
4920 

CCCTCCCAAT GCGGTTTAAA ACATAAATAA AAAAC C AG AC TCTGTTTGGA TTTGGATCAA 
4980 

GCAAGTGTCT TGCTGTCTCT CGAGGGATCT TTGTGAAGGA ACCTTACTTC TGTGGTGTGA 
5040 

CATAATTGGA CAAACTACCT ACAGAGATTT AAAGCTCTAA GGTAAATATA AAATTTTTAA 
5100 

GTGTATAATG TGTTAAACTA CTGATTCTAA TTGTTTGTGT ATTTTAGATT CCAACCTATG 
5160 

GAACTGATGA ATGGGAGCAG TGGTGGAATG CCTTTAATGA GGAAAACCTG TTTTGCTCAG 
5220 

AAGAAATGCC ATCTAGTGAT GATGAGGCTA CTGCTGACTC TCAACATTCT ACTCCTCCAA 
5280 

AAAAGAAGAG AAAGGTAGAA GACCCCAAGG ACTTTCCTTC AGAATTGCTA AGTTTTTTGA 
5340 



GTCATGCTGT GTTTAGTAAT AGAACTCTTG CTTGCTTTGC TATTTACACC ACAAAGGAAA 
5400 



PCT/EP97/05251 

WO 98/13499 
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mm »mrr kAAAATATTC TGTAACCTTT ATAAGTAGGC 
AAGCTGCACT GCTATACAAG AAAATTATGG AAAAAT ATT C 

5460 

KBMam ™»« — - TTCTTACTCC ACACAGGCAT -AGTGTCTG 
5520 

^ATTAATAA CT,T=CTC« AAATTGTGTA CCTTTAGCTT TTTAATTTGT «»«™ 
5580 

ATAAGGAATA TTTGATGTAT AGTGCCTTGA CTAGAGATCA TAATCAGCCA TACCACATTT 
5640 

,^ sl . rrTr CCACACCTCC CCCTGAACCT GAAACATAAA 
GTAGAGGTTT TACTTGCTTT AAAAAACCTC CCACACCTCU 

5700 

ATGAATGCAA TTGTTGTTGT TAACTTGTTT ATTGCAGCTT ATAATGGTTA CAAATAAAGC 
5760 

AATAGCATCA CAAATTTCAC AAATAAAGCA TTTTTTTCAC TGCATTCTAG TTGTGGTTTG 
5820 

TTATCATGTC TGGATCCGGC TGTGGAATGT GTGTCAGTTA 



TCCAAACTCA TCAATGTATC 
5880 

GGGTGTGGAA AGTGCCCAGG CTCCCCAGCA GGCAGAAGTA TGCAAAGCAT GCATCTCAAT 
5940 

TAGTCAGCAA CCAGGTGTGG AAAGTCGCCA GGCTCCCCAG CAGGCAGAAG TATGCAAAGC 
6000 

ATGCATCTCA ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTCCGCGCAT CGCGGCGCTA 
6060 

ACTCCGCCGA GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA 



6120 



GAGGCCGAGG CCGCCTCGGC CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA 



6180 
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GGCCTAGGCT TTTGCAAAAA GCTTGGACAC AAGACAGGCT TGCGAGATAT GTTTGAGAAT 
6240 

ACCACTTTAT CCCGCGTCAG GGAGAGGCAG TGCGTAAAAA GACGCGGACT CATGTGAAAT 
6300 

ACTGGTTTTT AGTGCGCCAG ATCTCTATAA TCTCGCGCAA CCTATTTTCC CCTCGAACAC 
6360 

TTTTTAAGCC GTAGATAAAC AGGCTGGGAC ACTTCACATG AG C G AAAAAT ACATCGTCAC 
6420 

CTGGGACATG TTGCAGATCC ATGCACGTAA ACTCGCAAGC CGACTGATGC CTTCTGAACA 
6480 

ATGGAAAGGC ATTATTGCCG TAAGCCGTGG CGGTCTGGTA CCGGGTGCGT TACTGGCGCG 
6540 

TGAACTGGGT ATTCGTCATG TCGATACCGT TTGTATTTCC AGCTACGATC ACGACAACCA 
6600 

GCGCGAGCTT AAAGTGCTGA AACGCGCAGA AGGCGATGGC GAAGGCTTCA TCGTTATTGA 
6660 

TGACCTGGTG GATACCGGTG GTACTGCGGT TGCGATTCGT GAAATGTATC CAAAAGCGCA 
6720 

CTTTGTCACC ATCTTCGCAA AACCGGCTGG TCGTCCGCTG GTTGATGACT ATGTTGTTGA 
6780 

' TATCCCGCAA GATACCTGGA TTGAACAGCC GTGGGATATG GGCGTCGTAT TCGTCCCGCC 
6840 

AATCTCCGGT CGCTAATCTT TTCAACGCCT GGCACTGCCG GGCGTTGTTC TTTTTAACTT 
6900 

CAGGCGGGTT ACAATAGTTT CCAGTAAGTA TTCTGGAGGC TGCATCCATG ACACAGGCAA 
6960 



PCT/EP97/05251 
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ACCTGAGCGA AAGCGTGTTC AAAGGGGGGT TTAAACATGC TGAAAGCTGG ACGCTAGTCC 
7020 

scggctttaa tgacggggga caaggggctg tgcagtcggc gcttgatggt aaaacgatcc 

7080 

CCAGTGGTA TGGCATGATT AACCGTGTGA TGTGGATCTG GCGCGGCATT GAGGGACGGO 
7140 

AAATCCTCGA CGTCCAGGCA CGTATTGTGA TGAGCGATGC CGAACGTACC GACGATGATT 
7200 

TATACGATAC GGTGATTGGC TAGGGTGGCG GCAACTGGAT TTATGAGTGG GCCCCGGATC 
7260 

TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT 
7320 

TAAAGCTCTA AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA 
7380 

TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT 



ATTGTTTGTG TATTTTAGAT 
7440 

GCCTTTAATG AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT 
7500 

ACTGCTGACT CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGAGGGGAAG 
7560 

AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT 



GACTTTCCTT CAGAATTGCT 
7620 



CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG 



GCTTGCTTTG CTATTTACAC 
7680 

GAAAAATATT CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT 
7740 
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TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT 
7800 

ACCTTTAGCT TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG 
7860 

ACTAGAGATC ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT 
7920 

CCCACACCTC CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT 
7980 

TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC 
8040 

ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT 
8100 

CTGGATCCCC AGGAAGCTCC TCTGTGTCCT CATAAACCCT AACCTCCTCT ACTTGAGAGG 
8160 

ACATTCCAAT CATAGGCTGC CCATCCACCC TCTGTGTCCT CCTGTTAATT AGGTCACTTA 
8220 

ACAAAAAGGA AATTGGGTAG GGGTTTTTCA CAGACCGCJT TCTAAGGGTA ATTTTAAAAT 
8280 

ATCTGGGAAG TCCCTTCCAC TGCTGTGTTC CAGAAGTGTT GGTAAACAGC CCACAAATGT 
8340 

CAACAGCAGA AACATACAAG CTGTCAGCTT TGCACAAGGG CCCAACACCC TGCTCATCAA 
8400 

GAAGCACTGT GGTTGCTGTG TTAGTAATGT GCAAAACAGG AGGCACATTT TCCCCACCTG 
8460 

TGTAGGTTCC AAAATATCTA GTGTTTTCAT TTTTACTTGG ATCAGGAACC CAGCACTCCA 
8520 



PCT/EP97/05251 
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***r*r rCTTGTGGTC AGTGTTCATC TGCTGACTGT 
CTGGATAAGC ATTATCCTTA TCCAAAACAG CCTTGTG.T 

8580 

CMCI G T AGC ATTTTTTGGG G^AGAGTTT — 
8640 

W *AX GGTTCCCCAC CAACAGCAAA AAAATGAAAA TTTGACCCTT 
ACACACCCTG CAGCTCCAAA GGTTCCCCAC 

8700 

CM!CKm ^GAGCAGCA T™AG TTTTTTOTGT CCG T GAA T G= — ^ 
8760 

CAGTAACAGC TTCCCACATC AAAATATTTC 



TAGCAGTTAC CCCAATAACC TCAGTTTTAA 
8820 

CC-m* — « — - AAGGAATTCT TGAAGACGAA AGGGCCTCGT 
8 880 

GATACGCCTA TTTTTATAGG TTAATGTCAT GATAATAATG GTTTCTTAGA CGTCAGGTGG 
8940 

CACTTTTCGG GGAAATGTGC GCGGAACCGC TATTTGTTTA TTTTTCTAAA TACATTCAAA 
9000 

T „=G CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA 
9060 

GAGTATGAGT ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT 
9120 

„ iiP^r-TGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG 
TCCTGTTTTT GCTCACCCAG AAACoCTGGT GAAA^i/w^ 

9180 

TGCACGAGTG GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG 
9240 

CCCGGAAGAA CGTTTTCCAA TGATGAGCAC TTTTAAAGTT CTGCTATGTG GCGCGGTATT 
9300 



WO 98/13499 PCT/EP97/05251 
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ATCCCGTGTT GACGCCGGGC AAGAGCAACT CGGTCGCCGC ATACACTATT CTCAGAATGA 
9360 

CTTGGTTGAG TACTCACCAG TCACAGAAAA GCATCTTACG GATGGCATGA CAGTAAGAGA 
9420 

ATTATGCAGT GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC TTCTGACAAC 
9480 

GATCGGAGGA CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG 
9540 

CCTTGATCGT TGGGAACCGG AGCTGAATGA AGCCATACCA AACGACGAGC GTGACACCAC 
9600 

GATGCCTGCA GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT 
9660 

AGCTTCCCGG CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT 
9720 

GCGCTCGGCC CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG 
9780 

GTCTCGCGGT ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT 
9840 

CTACACGACG GGGAGTCAGG CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG 
9900 

TGCCTCACTG ATTAAGCATT GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT 
9960 

TGATTTAAAA CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT 
10020 

CATGACCAAA ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA 
10080 



PCT/EP97/05251 

WO 98/13499 
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^mnnrrm ATCTGCTGCT TGCAAACAAA 
GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA 

10140 

^ C „CCG CCC-C- ™-™m =™ CTCTTTTTCC 

10200 

GAAGGTAACT GGC^AGCA GAGCCCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA 
10260 

mrrr * rrGC CTACA TACCTCGCTC TGCTAATCCT 
GTTAGGCCAC CACTTCAAGA ACTCTGTAGC ACCGCCTACA TA 

10320 

GTCGTGTCTT ACCGGGTTGG ACTCAAGACG 



TTACCAGTG GCTGCTGCCA GTGGCGATAA 

AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CACAGCCCAG 



G' 

10380 



ATAGTTACCG GATAAGGCGC 
10440 



CTTGGAGCGA ACGACCTACA CCGAACTGAG ATACCTACAG CGTGAGCTAT GAGAAAGCGC 
10500 

CACGCTTCCC GAAGGGAGAA AGGCGGACAG GTATCCGGTA AGCGGCAGGG TCGGAACAGG 
10560 

rrrCTGGTAT CTTTATAGTC CTGTCGGGTT 
AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT C 

10620 

T CGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG 
10680 

GAAAAACCCC AGCAACGCGG CCT^ACG GTTCCTGGCC TTTTGCTGGC C^GGTCA 
10740 

CATGTTCTTT CCTGCGTTAT CCCCTGArTC TGTGGATAAC CGTATTACCG CCTTTGAGTG 
10800 

AGCTGATACC GCTCGCCGCA GCCGAACGAC CGAGCGCAGC GAGTCAGTGA GCGAGGAAGC 
10860 
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GGAAGAGCGC CTGATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT CACACCGCAT 
10920 

ATGGTGCACT CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGT ATACACTCCG 
10980 

CTATCGCTAC GTGACTGGGT CATGGCTGCG CCCCGACACC CGCCAACACC CGCTGACGCG 
11040 

CCCTGACGGG CTTGTCTGCT CCCGGCATCC GCTTACAGAC AAGCTGTGAC CGTCTCCGGG 
11100 

AGCTGCATGT GTCAGAGGTT TTCACCGTCA TCACCGAAAC GCGCGAGGCA GC 
11152 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGGATCGG GAGATCTCC 
19 

(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

CCGCCTCAGA AGCCATAGAG CC 
22 

(2 ) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14455 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii ) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi , SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AAGCTTGGGC AGAAATGGTT GAACTCCCGA GAGTGTCCTA CACCTAGGGG AGAAGCAGCC 



60 



ftAGGGGTTGT TTCCC^ «CCC TCTGCGCACA ™< 



120 
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AAAGACATAT TCATTCTCTG CTGCAAACTT 
180 

GGGGAAGTTG CGGTTCGTGC TCGCAGGGCT 
240 

GTGCAAGATT ACAATCTAAA CAATTCGGAG 
300 

CAGCCAACTT CCTCTTACAA GCCGCATCGA 
360 

CTTGCTAAAA ATTATATTTT TACCAATAAG 
420 

ATGTTAAGAA ATGAATCATT ATCTTTTAGT 
480 

ATGGGAATAG AAAATAGAAA GAGACGCTCA 
540 

TTGACCACAG GCCTAGAAGT AAAAAAGGGA 
600 

AGGTGGTGGC AACCAGGGAC TTATAGGGGA 
660 

TTACCATATA CAGGAAGATA TGACTTAAAT 
720 

AAAGTGTTAT ATAGATCCCT CCCTTTTCGT 
780 

TGTATGTTGT CTCAAGAAGA AAAAGACGAC 
840 

CTAGGAACAG GAATGCACTT TTGGGGAAAG 
900 



GGCATAGCTC TGCTTTGCCT GGGGCTATTG 

CTCACCCTTG ACTCTTTTAA TAGCTCTTCT 

AACTCGACCT TCCTCCTGAG GCAAGGACCA 

TTTTGTCCTT CAGAAATAGA AATAAGAATG 

ACCAATCCAA TAGGTAGATT ATTAGTTACT 

ACTATTTTTA CTCAAATTCA GAAGTTAGAA 
ACCTCAATTG AAGAACAGGT GCAAGGACTA 

AAAAAGAGTG TTTTTGTCAA AATAGGAGAC 

CCTTACATCT ACAGACCAAC AGATGCCCCC 

TGGGATAGGT GGGTTACAGT CAATGGCTAT 

GAAAGACTCG CCAGAGCTAG ACCTCCTTGG 

ATGAAACAAC AGGTACATGA TTATATTTAT 

ATTTTCCATA CCAAGGAGGG GACAGTGGCT 
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aaCOKaB AACAT.ATTG ™ TCS ATAGCCTTTA 

960 

TT GGGCGAAG CTTGCGGTTC CCAGGGCTTA AGTAAGTTTT TGGTTACAAA CTGTTCTTAA 
1020 

AACGAGGATG TGAGACAAGT GGTTTCCTGA G^GTTTGG TATCAAAGGT TCTGATGTGA 
1080 

GCTCTGAGTG TTCTATTTTC CTATGTTCTT TTGGAATTTA TCCAAATCTT ATGTAAATGC 
1140 

TTATGTAAAC CAAGATATAA AAGAGTGCTG ATTTTTTGAG TAAACTTGCA ACAGTCCTAA 
1200 

CATTCACCTC TTGTGTGTTT GTGTCTGTTC GCCATCCCGT CTCCGCTCGT CACTTATCCT 
1260 

TCACTTTCCA GAGGGTCCGG GCGCAGACCC CGGCGACCCT CAGGTCGGCC GACTGCGGCA 
1320 

GCTGGCGCCC GAACAGGGAC CCTCGGATAA GTGAGGCTTG TCTCTATTTC TACTATTTGG 
1380 

TGTTTGTCTT GTATTGTCTC TTTCTTGTCT GGCTATCATC ACAAGAGGGG AACGGAGTGA 
1440 

CCATAGGGAC CAAGCTAGCG ACTGAAAATG AGACATATTA TCTGCCACGG AGGTGTTATT 
1500 

ACCGAAGAAA TGGCCGCCAG TCTTTTGGAC CAGCTGATCG AAGAGGTACT GGCTGATAAT 
1560 

CTTCCAGGTC CTAGCCATTT TGAACCACCT ACCCTTCACG AACTGTATGA TTTAGACGTG 
1620 

.CGGCCGCCG AAGATCGCAA CGAGGAGGCG GTTTCGCAGA TTTTTCCCGA CTCTGTAATG 



1680 
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TTGGCGGTGC AGGAAGGGAT TGACTTACTG ACTTTTCCGC CGGCGCCCGG TTCTCCGGAG 
1740 

CCGCCTCACC TTTCCCGGCA GCCCGAGCAG CCGGAGCAGA GAGCCTTGGG TCCGGTTTCT 
1800 

ATGCCAAACC TTGTACCGGA GGTGATCGAT CTTACCTGCC ACGAGGCTGG CTTTCCACCC 
1860 

AGTGACGACG AGGATGAAGA GGGTGAGGAG TTTGTGTTAG ATTATGTGGA GCACCCCGGG 
1920 

CACGGTTGCA GGTCTTGTCA TTATCACCGG AGGAATACGG GGGACCCAGA TATTATGTGT 
1980 

TCGCTTTGCT ATATGAGGAC CTGTGGCATG TTTGTCTACA GTAAGTGAAA ATTATGGGCA 
2040 

GTGGGTGATA GAGTGGTGGG TTTGGTGTGG TAATTTTTTT TTTAATTTTT ACAGTTTTGT 
2100 

GGTTTAAAGA ATTTTGTATT GTGATTTTTT TAAAAGGTCC TGTGTCTGAA CCTGAGCCTG 
2160 

AGCCCGAGCC AGAACCGGAG CCTGCAAGAC CTACCCGCCG TCCTAAAATG GCGCCTGCTA 
2220 

TCCTGAGACG CCCGACATCA CCTGTGTCTA GAGAATGCAA TAGTAGTACG GATAGCTGTG 
2280 

ACTCCGGTCC TTCTAACACA CCTCCTGAGA TACACCCGGT GGTCCCGCTG TGCCCCATTA 
2340 

AACCAGTTGC CGTGAGAGTT GGTGGGCGTC GCCAGGCTGT GGAATGTATC GAGGACTTGC 
2400 

TTAACGAGCC TGGGCAACCT TTGGACTTGA GCTGTAAACG CCCCAGGCCA TAAGGTGTAA 
2460 



* « 
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n tt A aCGCCTT TGTTTGCTGA ATGAGTTGAT GTAAGTTTAA 
ACCTGTGATT GCGTGTGTGG TTAACGCCTT 

2520 

»mo rrrTrTTAAA TGGGGCGGGG CTTAAAGGGT 
TAAAGGGTGA GATAATGTTT AACTTGCATG GCGTGTTAAA 

2580 

„GCG CCGTGGGCTA ATCTTGGTTA CATCGACC CATGGAGGCT TGGGAGTGTT 
2640 

TGGAAGATTT TTCTGCTGTG CGTAACTTGC TGGAACAGAG CTCTAACAGT AC™ 
2700 

TTTGGAGGTT TCTGTGGGGC TCATCCCAGG CAAAGTTAGT CTGCAGAATT AAGGAGGATT 

Hgtggga a^gaagag gt^gaaax c^tggtga gctgtttgat ^gaa.g 

2820 

t oggtgacca ggcgcttttc caagagaagg tcatgaagag tttggatttt tccagaccgg 

2880 

gocgcgctgg ggctggtgt, gcttttttga gttttataaa ggataaatg. agcgaagaaa 

2940 

CCCATCTGAG GGGGGGGTAG CTGGTGGATT TTCTGGCGAT GGATCTGTG. AGAGCGGTTG 
3000 

TGAGACACAA GAATCGCCTG CTACTGTTGT CTTCCGTCCG CCCGGCGATA ATACCGACGG 
3060 

AGGAGCAGCA GCAGCAGGAG GAGGAAGCCA GGCGGCGGCG GGAGGAGGAG AGCCCATGGA 
3120 

ACCCGAGAGC GGGCGTGGAG CCTCGGGAAT GAATGTTGTA CAGGTGGCTG AACTGTATCC 
3180 

AGAACTGAGA CGCATTTTGA CAATTACAGA GGATGGGCAG GGGCTAAAGG GGGTAAAGAG 
3240 
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GGAGCGGGGG GCTTGTGAGG CTACAGAGGA GGCTAGGAAT CTAGCTTTTA GCTTAATGAC 
3300 

CAGACACCGT CCTGAGTGTA TTACTTTTCA ACAGATCAAG GATAATTGCG CTAATGAGCT 
3360 

TGATCTGCTG GCGC AG AAGT ATTCCATAGA GCAGCTGACC ACTTACTGGC TGCAGCCAGG 
3420 

GGATGATTTT GAGGAGGCTA TTAGGGTATA TGCAAAGGTG GCACTTAGGC CAGATTGCAA 
3480 

GTACAAGATC AGCAAACTTG T AAAT AT C AG GAATTGTTGC TACATTTCTG GGAACGGGGC 
3540 

CGAGGTGGAG ATAGATACGG AGGATAGGGT GGCCTTTAGA TGTAGCATGA TAAATATGTG 
3600 

GCCGGGGGTG CTTGGCATGG ACGGGGTGGT TATTATGAAT GTAAGGTTTA CTGGCCCCAA 
3660 

TTTTAGCGGT ACGGTTTTCC TGGCCAATAC CAACCTTATC CTACACGGTG TAAGCTTCTA 
3720 

TGGGTTTAAC AATACCTGTG TGGAAGCCTG GACCGATGTA AGGGTTCGGG GCTGTGCCTT 
3780 

TTACTGCTGC TGGAAGGGGG TGGTGTGTCG CCCCAAAAGC AGGGCTTCAA TTAAGAAATG 
3840 

CCTCTTTGAA AGGTGTACCT TGGGTATCCT GTCTGAGGGT AACTCCAGGG TGCGCCACAA 
3900 

TGTGGCCTCC GACTGTGGTT GCTTCATGCT AGTGAAAAGC GTGGCTGTGA TTAAGCATAA 
3960 

CATGGTATGT GGCAACTGCG AGGACAGGGC CTCTCAGATG CTGACCTGCT CGGACGGCAA 
4020 



PCT7EP97/05251 

WO 98/13499 

-118- 

„,^r*CCA "TCACGTAGC CAGCCACTCT CGCAAGGCCT GGCCAGTGTT 
CTGTCACCTG CTGAAGACCA .TCA<-Ui 

4080 

TGAGCATAAC ATAGTGAGGC GCTGTTCCTT GGA™ AAGAGGAGGG CT 
4140 

t CGTTAGCAA TGGAATTTCA GTCACACTAA GATATTGCTT GAGCCCGAGA GCATGTCCAA 
4200 

GGTGAACCTG AACGGGGTGT TTGACATGAC CATGAAGATC TGGAAGGTGC TGAGGTACGA 
4260 

TGAGACCCGC ACCAGGTGCA GAGGGTGCGA GTGTGGCGGT AAACATATTA GGAAGCAGGC 
4320 

TGTGATGCTG GATGTGACCG AGGAGCTGAG GGCGGATGAG TTGGTGCTGG CCTGCACCCG 
4380 

CGCTGAGTTT GGCTCTAGCG ATGAAGATAC AGATTGAGGT ACTGAAATGT GTGGGCGTGG 
4440 

CTTAAGGGTG GGAAAGAATA TATAAGGTGG GGGTCTTATG TAGTTTTGTA TCTGTT^C 
4500 

„GCG GCCGCCATGA GCACGAACTC GTTTGATGGA AGCATTGTGA GCTCATATTT 
4560 

GACAACGCGC ATGCCCCCAT GGGCCGGGGT GCGTCAGAAT GTGATGGGCT CCAGCATTGA 
4620 

TGGTCGCC CC GTCCTGCCCG GAAAGTGTAG TAGCTTGAGG TACGAGACCG TGTCTGGAAC 
4680 

GCCGTTGGAG ACTGCAGCCT CCGCCGCCGC TTGAGGGGGT GGAGGGAGGG CCCGCGGGAT 
4740 

TGTGACTGAC TTTGCTTTCC TGAGCCCGCT TGCAAGCAGT GCAGCTTCCC GTTCATCCGC 
4800 
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CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC GGGAACTTAA 
4860 

TGTCGTTTCT CAGCAGCTGT TGGATCTGCG CCAGCAGGTT TCTGCCCTGA AGGCTTCCTC 
4920 

CCCTCCCAAT GCGGTTTAAA ACATAAATAA AAAACCAGAC TCTGTTTGGA TTTGGATCAA 
4980 

GCAAGTGTCT TGCTGTCTCT CGAGGGATCT TTGTGAAGGA ACCTTACTTC TGTGGTGTGA 
5040 

CATAATTGGA CAAACTACCT ACAGAGATTT AAAGCTCTAA GGTAAATATA AAATTTTTAA 
5100 

GTGTATAATG TGTTAAACTA CTGATTCTAA TTGTTTGTGT ATTTTAGATT CCAACCTATG 
5160 

GAACTGATGA ATGGGAGCAG TGGTGGAATG CCTTTAATGA GGAAAACCTG TTTTGCTCAG 
5220 

AAGAAATGCC ATCTAGTGAT GATGAGGCTA CTGCTGACTC TCAACATTCT ACTCCTCCAA 
5280 

AAAAGAAGAG AAAGGTAGAA GACCCCAAGG ACTTTCCTTC AGAATTGCTA AGTTTTTTGA 
5340 

GTCATGCTGT GTTTAGTAAT AGAACTCTTG CTTGCTTTGC TATTTACACC ACAAAGGAAA 
5400 

AAGCTGCACT GCTATACAAG AAAATTATGG AAAAAT ATT C TGTAACCTTT ATAAGTAGGC 
5460 

ATAACAGTTA TAATCATAAC ATACTGTTTT TTCTTACTCC ACACAGGCAT AGAGTGTCTG 
5520 

CTATTAATAA CTATGCTCAA AAATTGTGTA CCTTTAGCTT TTTAATTTGT AAAGGGGTTA 
5580 



PCT/EP97/05251 

WO 98/13499 

-120- 

ATAAGGAATA TTTGATGTAT AGTGCCTTGA CTAGAGATCA TAATCAGCCA TACCACATTT 
5640 

GTAGAGGTTT TACTTGCTTT AAAAAACCTC CCACACCTCC CCCTGAACCT GAAACATAAA 
5700 

.mmrrirrn ATAATGGTTA CAAATAAAGC 
ATGAATGCAA TTGTTGTTGT TAACTTGTTT ATTGCAGCTT ATAAT 

5760 

aatagcatca caaatttcac aaataaagga TOTT tgac TGCATTCTAG TTGTGGTTTG 

5820 

TCCAAACTCA TCAATGTATC TTATCATGTC TGGATCCGGC TGTGGAATGT GTGTGAGTTA 
5880 

GGGTGTGGAA AGTGGGGAGG GTGGGGAGGA GGGAGAAGTA TGCAAAGCAT GCATCTCAAT 
5940 

TAGTCAGCAA CCAGGTGTGG AAAGTCCCCA GGCTCCCCAG GAGGGAGAAG TATGCAAAGC 
6000 

ATGCATCTCA ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTGCGGGGAT CCCGCCCCTA 
6060 

ACTCCGCCCA GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA 
6120 

GAGGCCGAGG GCGCCTCGGC CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA 
6180 

GGCCTAGGCT TTTGCAAAAA GCTTGGACAC AAGACAGGCT TGCGAGATAT GTTTGAGAAT 
6240 

AGCACTTTAT CCCGCGTCAG GGAGAGGCAG TGCGTAAAAA GACGCGGACT CATGTGAAAT 
6300 

ACTGGTTTTT AGTGCGCCAG ATGTGTATAA TCTCGCGCAA CCTATTTTCC CCTCGAACAC 
6360 
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TTTTTAAGCC GTAGATAAAC AGGCTGGGAC ACTTCACATG AGCGAAAAAT ACATCGTCAC 
6420 

CTGGGACATG TTGCAGATCC ATGCACGTAA ACTCGCAAGC CGACTGATGC CTTCTGAACA 
6480 

ATGGAAAGGC ATTATTGCCG TAAGCCGTGG CGGTCTGGTA CCGGGTGCGT TACTGGCGCG 
6540 

TGAACTGGGT ATTCGTCATG TCGATACCGT TTGTATTTCC AGCTACGATC ACGACAACCA 
6600 

GCGCGAGCTT AAAGTGCTGA AACGCGCAGA AGGCGATGGC GAAGGCTTCA TCGTTATTGA 
6660 

TGACCTGGTG GATACCGGTG GTACTGCGGT TGCGATTCGT GAAATGTATC CAAAAGCGCA 
6720 

CTTTGTCACC ATCTTCGCAA AACCGGCTGG TCGTCCGCTG GTTGATGACT ATGTTGTTGA 
6780 

TATCCCGCAA GATACCTGGA TTGAACAGCC GTGGGATATG GGCGTCGTAT TCGTCCCGCC 
6840 

AATCTCCGGT CGCTAATCTT TTCAACGCCT GGCACTGCCG GGCGTTGTTC TTTTTAACTT 
6900' 

CAGGCGGGTT ACAATAGTTT CCAGTAAGTA TTCTGGAGGC TGCATCCATG ACACAGGCAA 
6960 

ACCTGAGCGA AACCCTGTTC AAACCCCGCT TTAAACATCC TGAAACCTCG ACGCTAGTCC 
7020 

GCCGCTTTAA TCACGGCGCA CAACCGCCTG TGCAGTCGGC CCTTGATGGT AAAACCATCC 
7080 

CTCACTGGTA TCGCATGATT AACCGTCTGA TGTGGATCTG GCGCGGCATT GACCCACGCG 
7140 
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WO 98/13499 

-122- 

C G T CCAGGCA CGTATTGTGA TGAGCGA.GC « GACGATGATT 

7200 

TATACGATAC GGTGATTGGC TACCGTGGCG GCAACTGGAT TTATGAGTGG GCCCCGGATC 
7260 

TTTGTGAAGG AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT 
7320 

TAAAGCTCTA AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA 

ATTGTTTGTG TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTCGAAT 
7440 

irru^ACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT 
GCCTTTAATG AGGAAAACCT G1li^v.Au 

7500 

ACTGCTGACT CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG 
7560 

GACTTTCCTT CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT 
7620 

„^** ^arCTGCAC TGCTATACAA GAAAATTATG 
GCTTGCTTTG CTATTTACAC CACAAAGGAA AAAGCTGCAC 

7680 

GAAAAATATT CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT 
7740 

TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT 
7800 

•Ccm-CT TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG 
7860 

ACTAGAGATC ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT 
7920 
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CCCACACCTC CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT 
7980 

TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC 
8040 

ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT 
8100 

CTGGATCCCC AGGAAGCTCC TCTGTGTCCT CATAAACCCT AACCTCCTCT ACTTGAGAGG 
8160 

ACATTCCAAT CATAGGCTGC CCATCCACCC TCTGTGTCCT CCTGTTAATT AGGTCACTTA 
8220 

ACAAAAAGGA AATTGGGTAG GGGTTTTTCA CAGACCGCTT TCTAAGGGTA ATTTTAAAAT 
8280 

ATCTGGGAAG TCCCTTCCAC TGCTGTGTTC CAGAAGTGTT GGTAAACAGC CCACAAATGT 
8340 

CAACAGCAGA AACATACAAG CTGTCAGCTT TGCACAAGGG CCCAACACCC TGCTCATCAA 
8400 

GAAGCACTGT GGTTGCTGTG TTAGTAATGT GCAAAACAGG AGGCACATTT TCCCCACCTG 
8460 

TGTAGGTTCC AAAATATCTA GTGTTTTCAT TTTTACTTGG ATCAGGAACC CAGCACTCCA 
8520 

CTGGATAAGC ATTATCCTTA TCCAAAACAG CCTTGTGGTC AGTGTTCATC TGCTGACTGT 
8580 

CAACTGTAGC ATTTTTTGGG GTTACAGTTT GAGCAGGATA TTTGGTCCTG TAGTTTGCTA 
8640 



ACACACCCTG' CAGCTCCAAA GGTTCCCCAC CAACAGCAAA AAAATGAAAA TTTGACCCTT 
8700 
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GAATGGGTTT TCCAGCACCA TTTTCATGAG TTTTTTGTGT CCCTGAATGC AAGTTTAACA 
8760 

TAGCAGTTAC CCCAATAACC TCAGTTTTAA CAGTAACAGC TTCCCACATC AAAATATTTC 
8820 

CACAGGTTAA GTCCTCATTT AAATTAGGCA AAGGAATTCT TGAAGACGAA AGGGCCTCGT 
8880 

„ aTAGG TTAATGTCAT GATAATAATG GTTTCTTAGA CGTCAGGTGG 
GATACGCCTA TTTTTATAGG TTAAibiun 

8940 

CACTTTTCGG GGAAATGTGC GCGGAACCCC TATTTGTTTA TTTTTCTAAA TAGATTCAAA 
9000 

TATGTATCCG CTCATGAGAC AATAACCCTG ATAAATGCTT CAATAATATT GAAAAAGGAA 
9060 

GAGTATGAGT ATTCAACATT TCCGTGTCGC CCTTATTCCC TTTTTTGCGG CATTTTGCCT 
9120 

TCCTGTTTTT GCTCAGCGAG AAACGCTGGT GAAAGTAAAA GATGCTGAAG ATCAGTTGGG 
9180 

TGCACGAGTG GGTTACATCG AACTGGATCT CAACAGCGGT AAGATCCTTG AGAGTTTTCG 
9240 

mp . >fl , r . rr!ir TTTTAAAGTT CTGCTATGTG GCGCGGTATT 
CCCCGAAGAA CGTTTTCCAA TGATGAGCAC TTTTAAAGu. 1 v. 

9300 

ATCCCGTGTT GACGCCGGGC AAGAGCAACT CGGTCGGCGC ATACACTATT CTCAGAATGA 
9360 

CTTGGTTGAG TACTCACCAG TGACAGAAAA GGATCTTACG GATGGCATGA CAGTAAGAGA 
9420 

ATTATGCAGT GCTGCCATAA CCATGAGTGA TAACACTGCG GCCAACTTAC TTCTGACAAC 
9480 
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GATCGGAGGA CCGAAGGAGC TAACCGCTTT TTTGCACAAC ATGGGGGATC ATGTAACTCG 
9540 

CCTTGATCGT TGGGAACCGG AGCTGAATGA AGCCATACCA AACGACGAGC GTGACACCAC 
9600 

GATGCCTGCA GCAATGGCAA CAACGTTGCG CAAACTATTA ACTGGCGAAC TACTTACTCT 
9660 

AGCTTCCCGG CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT 
9720 

GCGCTCGGCC CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTGAGCGTGG 
9780 

GTCTCGCGGT ATCATTGCAG CACTGGGGCC AGATGGTAAG CCCTCCCGTA TCGTAGTTAT 
9840 

CTACACGACG GGGAGTCAGG CAACTATGGA TGAACGAAAT AGACAGATCG CTGAGATAGG 
9900 

TGCCTCACTG ATTAAGCATT GGTAACTGTC AGACCAAGTT TACTCATATA TACTTTAGAT 
9960 

TGATTTAAAA CTTCATTTTT AATTTAAAAG GATCTAGGTG AAGATCCTTT TTGATAATCT 
10020 

CATGACCAAA ATCCCTTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA 
10080 

GATCAAAGGA . TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT TGCAAACAAA 
10140 

AAAACCACCG CTACCAGCG.G TGGTTTGTTT GCCGGATCAA GAGCTACCAA CTCTTTTTCC 
10200 



GAAGGTAACT GGCTTCAGCA GAGCGCAGAT ACCAAATACT GTCCTTCTAG TGTAGCCGTA 
10260 
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m « a c -GTGTAGC ACCGCGTACA TACCTCGCTC TGCTAATCC 

10320 

GTTACCAGTG GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG 
10380 

ATAGTTACCG GATAAGGCGC AGCGGTCGGG CTGAACGGGG GGTTCGTGCA CAGAGCGGAG 
10440 

CTTGGAGCGA ACGACCTACA CGGAACTGAG ATACCTACAG CGTGAGCTAT GAGAAAGGGC 
10500 

G.CGCTTCGG GAAGGGAGAA AGGGGGAGAG GTATCCGGTA AGGGGGAGGG TCGGAACAGG 
10560 

^nnm rGCCTGGTAT CTTTATAGTC CTGTCGGGTT 
AGAGCGCACG AGGGAGCTTC CAGGGGGAAA CGCCTGGTAT C 

10620 

TCGCCACCTC TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG 
10680 

GAAAAACGCC AGCAAGGGGG CCTTTTTACG ™=GG TTTTGCTGGC CTTTTGCTCA 
10740 

CATGTTCTTT CCTGCGTTAT CCCCTGATTC TGTGGATAAC CGTATTACCG CCTTTGAGTG 
10800 

AGCTGATACC GCTGGGGGCA GCCGAACGAC GGAGGGCAGG GAGTCAGTGA GGGAGGAAGG 
10860 

rTP ATGCGGT ATTTTCTCCT TACGCATCTG TGCGGTATTT CACACCGCAT 
GGAAGAGCGC CTGATGCGGT ATinti^ 

10920 

ACCGCCTCAG AAGCCATAGA GCCCACCGCA TCCCCAGCAT GCCTGCTATT GTCTTCCCAA 
10980 

TCCTCCCCCT TGCTGTCCTG CCCCACCCCA CCGCGGAGAA TAGAATGACA CCTACTCAGA 
11040 
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CAATGCGATG CAATTTCCTC ATTTTATTAG GAAAGGACAG TGGGAGTGGC ACCTTCCAGG 
11100 

GTCAAGGAAG GCACGGGGGA GGGGCAAACA ACAGATGGCT GGCAACTAGA AGGCACAGTC 
11160 

GAGGCTGATC AGCGAGCTCT AGCATTTAGG TGACACTATA GAATAGGGCC CTCTAGATGC 
11220 

ATGCTCGAGC GGCCGCTTCT TTATTCTTGG GCAATGTATG AAAAAGTGTA AGAGGATGTG 
11280 

GCAAATATTT CATTAATGTA GTTGTGGCCA GACCAGTCCC ATGAAAATGA CATAGAGTAT 
11340 

GCACTTGGAG TTGTGTCTCC TGTTTCCTGT GTACCGTTTA GTGTAATGGT TAGTGTTACA 
11400 

GGTTTAGTTT TGTCTCCGTT TAAGTAAACT TGACTGACAA TGTTACTTTT GGCAGTTTTA 
11460 

CCGTGAGATT TTGGATAAGC TGATAGGTTA GGCATAAATC CAACAGCGTT TGTATAGGCT 
11520 

GTGCCTTCAG TAAGATCTCC ATTTCTAAAG TTCCAATATT CTGGGTCCAG GAAGGAATTG 
11580 

TTTAGTAGCA CTCCATTTTC GTCAAATCTT ATAATAAGAT GAGCACTTTG AACTGTTCCA 
11640 

GATATTGGAG CCAAACTGCC TTTAACAGCC AAAACTGAAA CTGTAGCAAG TATTTGACTG 
11700 

CCACATTTTG TTAAGACCAA AGTGAGTTTA GCATCTTTCT CTGCATTTAG TCTACAGTTA 
11760 



GGAGATGGAG CTGGTGTGGT CCACAAAGTT AGCTTATCAT TATTTTTGTT TCCTACTGTA 
11820 
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„G TGCTGTCAAA ***** — ™»«~ 
11880 

- _g GCCAATTTTT GTTTTGAGGG GATTTGTGTT TGGTGCATTA 
GAATCAAATT CTAGGCCATG GCCAATT1 1 

11940 

G5T GAACCAA A^CAAGCCC *«» — ™ TGGCTGTAGC GTCAAACATC 
12000 

AACCCCTTGG ^ G™ACCTCA AGCTTTTTGG ^ AGCGTAAAC 
12060 

gtTAATATCC AAGTTGTGGG CTGAGTTTAT AAAAAGAGGG 
AAGTAAAGGC CTTTGTTGTA GTTAATATCL « 

12120 

CCCTGTCCTA GTCTTAGATT .A^ TGAGCATCAA ACGGATAACT AACATCAAGT 
12180 

«™ arTPCTCCTG CTACATTAAG TTGCATATTG 
ATAAGGCGTC TGTTTTGAGA ATCAATCCTT AGTCCTCCTG 

12240 

rpm GGCTCCAGTA ACTTTAGTTT GCAAGGAAGT ATTATTAATA 
CCTTGTGAAT CAAAACCCAA GGCTCCAG1A a 

12300 

„ n „ rll . rTGTTTAGGT CGTCTGTTAC ATGCAAAGGA 
GTCACACCTG GACCAGTTGC TACGGTCAAA GTGTTTAGGT 

12360 

SCGCCGTAC, TTAGTCCTAG TTTTCCATTT TGTGTATAAA TGGGCTCTTT CAAGTCAATG 
12420 

CCCAAGCTAC CAGTGGCAGT AGTTAGAGGG CGTGAGGCAG TGATAGTAAG GGTACTGCTA 
12480 

TrtrGGGGCC TGATGTTTGC AGGGCTAGCT TTCCTTCTGA CACTGTGAGG 
TCGGTGGTGG TGAGGGGGCC 

12540 

GGTCCTTGGG TGGCAATGCT AAGTTTGGAG TCGTGCACGG TTAGCGGGGC CTGTGATTGC 
12600 
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ATGGTGAGTG TGTTGCCCGC GACCATTAGA GGTGCGGCGG CAGCCACAGT TAGGGCTTCT 
12660 

GAGGTAACTG TGAGGGGTGC AGATATTTCC AGGTTTATGT TTGACTTGGT TTTTTTGAGA 
12720 

GGTGGGCTCA CAGTGGTTAC ATTTTGGGAG GTAAGGTTGC CGGCCTCGTC CAGAGAGAGG 
12780 

CCGTTGCCCA TTTTGAGCGC AAGCATGCCA TTGGAGGTAA CTAGAGGTTC GGATAGGCGC 
12840 

AAAGAGAGTA CCCCAGGGGG ACTCTCTTGA AACCCATTGG GGGATACAAA GGGAGGAGTA 
12900 

AGAAAAGGCA CAGTTGGAGG ACCGGTTTCC GTGTCATATG GATACACGGG GTTGAAGGTA 
12960 

TCTTCAGACG GTCTTGCGCG CTTCATCTTG GATCTCAAGC CTGCCACACC TCACCTCGAC 
13020 

CATCCGCCGT CTCAAGACCG CGTACTTTAA TTACATCATC AGCAGCACCT CCGCCAGAAA 
13080 

CAACCCCGAC CGCCACCCGC TGCCGCCCGC CACGGTGCTC AGCCTACCTT GCGACTGTGA 
13140 

CTGGTTAGAC GCCTTTCTCG AGAGGTTTTC CGATCCGGTC GATGCGGACT CGCTCAGGTC 
13200 

CCTCGGTGGC GGAGTACCGT TCGGAGGCCG ACGGGTTTCC GATCCAAGAG TACTGGAAAG 
13260 

ACCGCGAAGA GTTTGTCCTC AACCGCGAGC CCAACAGCGA GCTCGAATTC AGATCCGAGC 
13320 



TCGGTACCAA GCTTGGGTCT CCCTATAGTG AGTCGTATTA ATTTCGATAA GCCAGTAAGC 
13380 
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^cnr urarAGCTCT GCTTATATAG ACCTCCCACC GTACACGCCT 
AGTGGGTTCT CTAGTTAGCC AGAGAGCTC1 

13440 

^.-p-. PTTACGACAT TTTGGAAAGT CCCGTTGATT 
ACCGCCCATT TGCGTCAATG GGGCGGAGTT GTTACGACAT 

13500 



TTGGTGCCAA AAGAAACTCC CATTGAGGTC AA.GGGO^G AGACTTGGAA ATCCCCGTGA 
13560 

GTCAAACCGC TATCCACGCC CATTGATGTA CTGCGAAAAC CGCATCACCA TGGTAATAGC 
13620 

GATGACTAAT ACGTAGATGT ACTGCCAAGT AGGAAAGTCC CATAAGGTCA TGTACTGGGC 
13680 

ATAATGCCAG GCGGGCCATT TACCGTCATT GACGTCAATA GGGGGCGTAC TTGGCATATG 
13740 

ATACACTTGA TGTACTGCCA AGTGGGCAGT TTACCGTAAA TAGTCGAGCG ATTGACGTCA 
13800 

ATGGAAAGTC CCTATTGGCG TTACTATGGG AACATACGTC ATTATTGACG TCAATGGGCG 
13860 

GGGGTCGTTG GGCGGTCAGC CAGGCGGGCG ATTTACCGTA AGTTATGTAA CGCGGAACTC 
13920 

CATATATGGG CTATGAACTA ATGACCCCGT AATTGATTAC TATTAATAAC TAGTCAATAA 
13980 

™*mn <vrrrccG?AC ATCGCGAAGC AGCGCAAAAC GCCTAACCCT 
TCAATGTCAA CGCGTATATC TGGCCCGTAC Ai^^ 

14040 

AAGCAGATTC TTCATGCAAT TGTCGGTCAA GCCTTGCCTT GTTGTAGCTT AAATTTTGCT 
14100 

CGGCCACTAC TCAGCGACCT CCAAGACACA AGCAGGGAGC AGATACTGGC TTAACTATGC 



14160 
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GGCATCAGAG CAGATTGTAC TGAGAGTCGA CCATAGGGGA TCGGGAGATC TCCCGATCCG 
14220 

TCTATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGTATACACT 
14280 

CCGCTATCGC TACGTGACTG GGTCATGGCT GCGCCCCGAC ACCCGCCAAC ACCCGCTGAC 
14340 

GCGCCCTGAC GGGCTTGTCT GCTCCCGGCA TCCGCTTACA GACAAGCTGT GACCGTCT.CC 
14400 

GGGAGCTGCA TGTGTCAGAG GTTTTCACCG TCATCACCGA AACGCGCGAG GCAGC 
14455 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GACGGATCGG GAGATCCGCG CGGTACACAG AATTCAGGAG ACACAACTCC AAGTGCATAC 
60 

TCTATGTCAT TTTCATGGGA CTGGTCTGGC CACAACTACA TTAATGAAAT ATTTGCCACA 
120 
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, „, - i ^i-.r\ GAATAAAGAA TCGTTTGTGT TATGTTTCAA 

TCCTCTTACA CTTTTTCATA 



CATTGCCCAA 

XCTTAUA 

180 



™r fcfcGTCATTTT TCATTCAGTA GTATAGCCCC 
CGTGTTTATT TTTCAATTGC AGAAAATTTC AAGTCATTT 

240 

^CA GATCACCGTA CCTTAATCAA ACTCACAGAA CCCTAGTATT 
ACCACCACAT AGCTTATACA GATCACCGT 

300 

C . CCT GGGA ««« — « G^CCCG GGCTGGCCTT 

360 

. TtACAGACAT ATTCTTAGGT GTTATATTCC ACACGGTTTC 
AAAAAGCATC ATATCATGGG TAACAGACAT A 

_ ____ AATAAACTCC CCGGGCAGCT CACTTAAGTT 

CTGTCGAGCC 
480 

GCTTAACGGG 



AAACGCTCAT CAGTGATATT 



C „G TGCAGC^ GAGCCACAGG CTGCTGTCCA ACTTGCGGTT 
540 

fTACG CCTACATGGG GGTAGAGTCA TAATCGTGCA TCAGGATAGG 
CGGCGAAGGA GAAGTCCACG CCTACATW- 

600 

„C TGCAGCAGCG CGCGAATAAA ^ CGCCGCTCCG TCCTGCAGGA 
660 

„ GCAGTGGTCT CCTCAGCGAT GA.CGCACC GCCCGCAGCA TAAGGCGCCT 
720 

prRGC QCACCCTGAT CTCACTTAAA TCAGCACAGT AACTGCAGCA 
TGTCCTCCGG GCACAGCAGC GCACCCio 

780 

„A „CA AAATCCCACA GTGCAAGGCG CTGTATCCAA AGCTCATGGC 
840 

„CA GAACCCACGT GGCCA.CA.A GCACAAGGGC AGGTAGATTA AGTGGCGACC 
900 
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CCTCATAAAC ACGCTGGACA TAAACATTAC CTCTTTTGGC ATGTTGTAAT TCACCACCTC 
960 

CCGGTACCAT ATAAACCTCT GATTAAACAT GGCGCCATCC ACCACCATCC TAAACCAGCT 
1020 

GGCCAAAACC TGCCCGCCGG CTATACACTG CAGGGAACCG GGACTGGAAC AATGACAGTG 
1080 

GAGAGCCCAG GACTCGTAAC CATGGATCAT CATGCTCGTC ATGATATCAA TGTTGGCACA 
1140 

ACACAGGCAC ACGTGCATAC ACTTCCTCAG GATTACAAGC TCCTCCCGCG TTAGAACCAT 
1200 

ATCCCAGGGA ACAACCCATT CCTGAATCAG CGTAAATCCC ACACTGCAGG GAAGACCTCG 
1260 

CACGTAACTC ACGTTGTGCA TTGTCAAAGT GTTACATTCG GGCAGCAGCG GATGATCCTC 
1320 

CAGTATGGTA GCGCGGGTTT CTGTCTCAAA AGGAGGTAGA CGATCCCTAC TGTACGGAGT 
1380 

GCGCCGAGAC AACCGAGATC GTGTTGGTCG TAGTGTCATG CCAAATGGAA CGCCGGACGT 
1440 

AGTCATATTT CCTGAAGCAA AACCAGGTGC GGGCGTGACA AACAGATCTG CGTCTCCGGT 
1500 

CTCGCCGCTT AGATCGCTCT GTGTAGTAGT TGTAGTATAT CCACTCTCTC AAAGCATCCA 
1560 

GGCGCCCCCT GGCTTCGGGT TCTATGTAAA CTCCTTCATG CGCCGCTGCC CTGATAACAT 
1620 

CCACCACCGC AGAATAAGCC ACACCCAGCC AACCTACACA TTCGTTCTGC GAGTCACACA 
1680 



PCT/EP97/05251 

WO 98/13499 

-134- 



CGGGAGGAGC — » TGTTTTTTTT TTTATTCCAA AAGATTATCC 
1740 

^CAA AATGAAGATC TATTAAGTGA ACGCGCTCCC GTGCGGTGGC G TC GTCAAAC 
1800 

XG.ACAGGCA AAGAAGAGAT AATGGCATTT GTAAGATGTT GGAGAA^GG TTCCAAAAGG 
1860 

arrrTAAACC CTTCAGGGTG AATCTCCTCT 
CAAACGGCCC TCACGTCCAA GTGGACGTAA AGGCTAAACC 

1920 

^CA^G GAGCAGGTTG AAGGA T GCG= AAATAATTGT GATCTCGGGA GGT.GTGAAT 
1980 

^rrrr AATATTAAGT CCGGCCATTG TAAAAATCTG CTCCAGAGCG 
ATATCTCTAA GCAAATCCCG AATATTAAvj i 

2040 

rCAGCGAATC ATGATTGCAA AAATTCAGGT TCCTCACAGA 
CCCTCCACCT TCAGCCTCAA GCAGCGAAK- 

2100 

GGTGTATAAG ATTGAAAAGC GGAACATTAA CAAAAATACC GCGATCCCGT AGGTCCCTTC 
2160 

GCAGGGCCAG CTGAACATAA TCGTGCAGGT GTGCACGGAC CAGCGGGGCG ACTTCCGCGC 
2220 

CCCACACTGA TTATGACACG CATACTCGGA GCTATGCTAA 
CAGGAACCTT GACAAAAGAA CCCACACl^A 

2280 

CCAGGGTAGC CCCGATGTAA GCTTTGTTGC ATGGGCGGCG ATATAAAATG CAAGGTGCTG 
2340 

CTCAAAAAAT CAGGCAAAGC CTCGGGGAAA AAAGAAAGCA CATCGTAGTC ATCCTCATGC 
2400 

AGATAAAGGC AGGTAAGCTC CGGAACCACC AGAGAAAAAG ACACCATTTT TCTCTCAAAC 
2460 
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ATGTCTGCGG GTTTCTGCAT AAACACAAAA TAAAATAACA AAAAAACATT TAAACATTAG 
2520 

AAGCCTGTCT TACAACAGGA AAAACAACCC TTATAAGCAT AAGACGGACT ACGGCCATGC 
2580 

CGGCGTGACC GTAAAAAAAC TGGTCACCGT GATTAAAAAG CACCACCGAC AGCTCCTCGG 
2640 

TCATGTCCGG AGTCATAATG TAAGACTCGG TAAACACATC AGGTTGATTC ATCGGTCAGT 
2700 

GCTAAAAAGC GACCGAAATA GCCCGGGGGA ATACATACCC GCAGGCGTAG AGACAACATT 
2760 

ACAGCCCCCA TAGGAGGTAT AACAAAATTA ATAGGAGAGA AAAACACATA AACACCTGAA 
2820 

AAACCCTCCT GCCTAGGCAA AATAGCACCC TCCCGCTCCA GAACAACATA CAGCGCTTCA 
2880 

CAGCGGCAGC CTAACAGTCA GCCTTACCAG TAAAAAAGAA AACCTATTAA AAAAACACCA 
2940 

CTCGACACGG CACCAGCTCA ATCAGTCACA GTGTAAAAAA GGGCCAAGTG CAGAGCGAGT 
3 000 

ATATATAGGA CTAAAAAATG ACGTAACGGT TAAAGTCCAC AAAAAACACC CAGAAAACCG 
3060 

CACGCGAACC TACGCCCAGA AACGAAAGCC AAAAAACCCA CAACTTCCTC AAATCGTCAC 
3120 

TTCCGTTTTC CCACGTTACG TAACTTCCCG GATCCTCTCC CGATCCCCTA TGGTCGACTC 
3180 

TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGTA TCTGCTCCCT GCTTGTGTGT 
3240 
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,r rrGAGCAAAA TTTAAGCTAC AACAAGGCAA GGCTTGACCG 
TGGAGGTCGC TGAGTAGTGC GCGAGCAAAA 

3300 

^GCAT GAAGAA.CTG CTTAGGGTTA ««m»C GCTGCTTCGC GATGTACGGG 
3360 

CC „C GCGTTGACAT TGATTATTGA CTAGTTATTA ATAGTAATCA ATTACGGGGT 
3420 

CATTAGTTCA TAGCCCATAT ATGGAGTTCC GCGTTACATA ACTTACGGTA AA.GGGCCGC 
3480 

CT GGC T GAGC GCCCAACGAC GCCCGCCCA. TGACGTCAAT ^ AATGACGTAT G^CCCA.AG 
3540 

TAACGCCAAT AGGGACTTTC CATTGACGTC AATGGGTGGA CTATTTACGG TAAACTGCCC 
3600 

rtTrAAGTG TATCATATGC CAAGTACGCC CCCTATTGAC GTCAATGACG 
ACTTGGCAGT ACATCAAGTG TATLAiAi 

3660 

GTAAATGGCC CGCCTGGCAT TATGCCCAGT ACATGACCTT ATGGGACTTT CCTACTTGGC 
3720 

AGTACATCTA CGTATTAGTC ATCGCTATTA CCATGGTGAT GCGGTTTTGG CAGTACATCA 
3780 

ATGGGCGTGG ATAGCGGTTT GACTCACGGG GATTTCCAAG TCCCAGCGC ATTGACGTCA 
3840 

rrr ACTTTCC AAAATGTCGT AACAACTCCG 
ATGGGACTTT GTTTTGGCAC CAAAATCAAC GGGACTTTCC AAAA 

3900 

CCCCATTGAC GCAAATGGGC GGTAGGCGTG TACGGTGGGA GGTCTATATA AGCAGAGCTG 
3960 

ir*r**CCC ACTGCTTACT GGCTTATCGA AATTAATACG ACTCACTATA 
TCTGGCTAAC TAGAGAACCC ACI^-ii* 

4020 
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GGGAGACCCA AGCTTGGTAC CGAGCTCGGA TCTGAATTCG AGCTCGCTGT TGGGCTCGCG 
4080 

GTTGAGGACA AACTCTTCGC GGTCTTTCCA GTACTCTTGG ATCGGAAACC CGTCGGCCTC 
4140 

CGAACGGTAC TCCGCCACCG AGGGACCTGA GCGAGTCCGC ATCGACCGGA TCGGAAAACC 
4200 

TCTCGAGAAA GGCGTCTAAC CAGTCACAGT CGCAAGGTAG GCTGAGCACC GTGGCGGGCG 
4260 

GCAGCGGGTG GCGGTCGGGG TTGTTTCTGG CGGAGGTGCT GCTGATGATG TAATTAAAGT 
4320 

AGGCGGTCTT- GAGACGGCGG ATGGTCGAGG TGAGGTGTGG CAGGCTTGAG ATCCAAGATG 
4380 

AAGCGCGCAA GACCGTCTGA AGATACCTTC AACCCCGTGT ATCCATATGA CACGGAAACC 
4440 

GGTCCTCCAA CTGTGCCTTT TCTTACTCCT CCCTTTGTAT CCCCCAATGG GTTTCAAGAG 
4500 

AGTCCCCCTG GGGTACTCTC TTTGCGCCTA TCCGAACCTC TAGTTACCTC CAATGGCATG 
4560 

CTTGCGCTCA AAATGGGCAA CGGCCTCTCT CTGGACGAGG CCGGCAACCT TACCTCCCAA 
4620 

AATGTAACCA CTGTGAGCCC ACCTCTCAAA AAAACCAAGT C AAAC AT AAA CCTGGAAATA 
4680 

TCTGCACCCC TCACAGTTAC CTCAGAAGCC CTAACTGTGG CTGCCGCCGC ACCTCTAATG 
4740 



GTCGCGGGCA ACACACTCAC CATGCAATCA CAGGCCCCGC TAACCGTGCA CGACTCCAAA 
4800 
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*rr ACCCCTCACA GTGTCAGAAG GAAAGCTAGC CCTGCAAACA 
CTTAGCATTG CCACCCAAGG ACCCCTCACA fa 

4860 

ACCCTTACTA TCACTGCCTC ACCCCCTCTA 



TCAGGCCCCC TCACCACCAC CGATAGCAGT 



4920 



«C T T GGGCATTGAC TTGAAAGAGC CCATTTATAC AGAAAATGGA 



ACTACTGCCA 
4980 

^AGGAG TAAAGXAGGG GGCTCCTTTG GA~= — - «™« 
5040 

GXAGCAAGTG GTGCAGGTGT GACTATTAAT AATACTTCCT TGCAAACTAA AGTTACTGGA 
5100 

GGC.TGGG.T TTGATTCACA AGGCAATATG CAACTTAATG TAGCAGGAGG ACTAAGGATT 
5160 

GATTCTCAAA ACAGACGCCT TATACTTGAT GTTAGTTATC CGTTTGATGC TCAAAACCAA 
5220 

CTAAATCTAA GACTAGGACA GGGCCCTCTT TTTATAAACT GAGCGCACAA CTTGGATATT 

Lacaaga aaggccttta cttgtttaca gcttcaaaca attccaaaaa ggttgagg^ 

5340 

^ctaagga cgccaaggg gttgatgttt gacgctacag gcataogcat taatgcagga 

5400 

gatgggcttg aa™c acctaatgca cgaaagaca* atccgctcaa aacaaaaat, 

5460 

GGCCATGGCC TAGAATTTGA TTCAAACAAG GCTATGGTTC CTAAACTAGG AACTGGCCTT 
5520 

AGTTTTGACA GCACAGGTGC CATTACAGTA GGAAACAAAA ATAATGATAA GCTAACTTTG 
5580 
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TGGACCACAC CAGCTCCATC TCCTAACTGT AGACTAAATG CAGAGAAAGA TGCTAAACTC 
5640 

ACTTTGGTCT TAACAAAATG TGGCAGTCAA ATACTTGCTA CAGTTTCAGT TTTGGCTGTT 
5700 

AAAGGCAGTT TGGCTCCAAT ATCTGGAACA GTTCAAAGTG CTCATCTTAT TATAAGATTT 
5760 

GACGAAAATG GAGTGCTACT AAACAATTCC TTCCTGGACC CAGAATATTG GAACTTTAGA 
5820 

AATGGAGATC TTACTGAAGG CACAGCCTAT ACAAACGCTG TTGGATTTAT GCCTAACCTA 
5880 

TCAGCTTATC CAAAATCTCA CGGTAAAACT GCCAAAAGTA ACATTGTCAG TCAAGTTTAC 
5940 

TTAAACGGAG ACAAAACTAA ACCTGTAACA CTAACCATTA CACTAAACGG TACACAGGAA 
6000 

ACAGGAGACA CAACTCCAAG TGCATACTCT ATGTCATTTT CATGGGACTG GTCTGGCCAC 
6060 

AACTACATTA ATGAAATATT TGCCACATCC TCTTACACTT TTTCATACAT TGCCCAAGAA 
6120 

' TAAAGAAGCG GCCGCTCGAG CATGCATCTA GAGGGCCCTA TTCTATAGTG TCACCTAAAT 
6180 

GCTAGAGCTC GCTGATCAGC CTCGACTGTG CCTTCTAGTT GCCAGCCATC TGTTGTTTGC 
6240 

CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT TTC CTAATAA 
6300 



AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT CTATTCTGGG GGGTGGGGTG 
6360 
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OOCCAOSACA «»«»« «=»™*™ GOCATSCTGG GGATGCGGTG 

6420 

GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT CTAGGGGGTA TCCCCACGCG 
6480 

CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT GACCGCTACA 
6540 

CTTOCCAOCO CCCTASCSCC CSCTCCTTTC CC^CTTCC CTTCOTTCT CSCCACSTTC 
6600 

„ ^.^rrrrr PGCATCCCTT TAGGGTTCCG ATTTAGTGCT 
GCCGGCTTTC CCCGTCAAGC TCTAAATCGG GGCATUUUii 

6660 

TTACGGCACC TCGACCCCAA AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG 
6720 

CCCTGATAGA CGGTTTTTCG CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC 
6780 

TTGTTCCAAA CTGGAACAAC ACTCAACCCT ATCTCGGTCT ATTCTTTTGA TTTATAAGGG 
6840 

ATTTTGGGGA TTTCGGCCTA TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG 



6900 



AATTAATTCT 
6960 



GTGGAATGTG TGTCAGTTAG GGTGTGGAAA GTCCCCAGGC TCCCCAGGCA 

GGCAGAAGTA TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCAGGTGTGG AAAGTCCCCA 
7020 

CAGGCAGAAG TATGCAAAGC ATGCATCTCA ATTAGTCAGC AACCATAGTC 



GGCTCCCCAG 
7080 

CCGCCCCTAA 
7140 



CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA GTTCCGCCCA TTCTCCGCCC 
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CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG CCGCCTCTGC CTCTGAGCTA 
7200 

TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT TTTGCAAAAA GCTCCCGGGA 
7260 

GCTTGTATAT CCATTTTCGG ATCTGATCAA GAGACAGGAT GAGGATCGTT TCGCATGATT 
7320 

GAACAAGATG GATTGCACGC AGGTTCTCCG GCCGCTTGGG TGGAGAGGCT ATTCGGCTAT 
7380 

GACTGGGCAC AACAGACAAT CGGCTGCTCT GATGCCGCCG TGTTCCGGCT GTCAGCGCAG 
7440 

GGGCGCCCGG TTCTTTTTGT CAAGACCGAC CTGTCCGGTG CCCTGAATGA ACTGCAGGAC 
7500 

GAGGCAGCGC GGCTATCGTG GCTGGCCACG ACGGGCGTTC CTTGCGCAGC TGTGCTCGAC 
7560 

GTTGTCACTG AAGCGGGAAG GGACTGGCTG CTATTGGGCG AAGTGCCGGG GCAGGATCTC 
7620 

CTGTCATCTC ACCTTGCTCC TGCCGAGAAA GTATCCATCA TGGCTGATGC AATGCGGCGG 
7680 . 

CTGCATACGC TTGATCCGGC TACCTGCCCA TTCGACCACC AAGCGAAACA TCGCATCGAG 
7740 

CGAGCACGTA CTCGGATGGA AGCCGGTCTT GTCGATCAGG ATGATCTGGA CGAAGAGCAT 
7800 

CAGGGGCTCG CGCCAGCCGA ACTGTTCGCC AGGCTCAAGG CGCGCATGCC CGACGGCGAG 
7860 

GATCTCGTCG TGACCCATGG CGATGCCTGC TTGCCGAATA TCATGGTGGA AAATGGCCGC 
7920 
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TTTTCTGGAT TCATGGAGTG *=GCCGGGTG GGTGTGGCGG ACCGGTATCA GGACATAGCG 
7980 

TTGGCTACCC TGAAGAGCTT GGCGGCGAAT GGGCTGACCG CTTCCTCGTG 

8040 

CTTTACGGTA TCGCCGCC CGATTCGCAG CGCATCGCCT TCTATCGCCT TCTTGACGAG 
8100 

TTCITCTGAG CGGGACTCTG GGGTTCGAAA TGACCGACCA AGCGACGCCG AACCTGCCAT 
8160 

CACGAGATTT CGA.TCCACC GCCGCC^ ATGAAAGGTT GGGCTTCGGA ATGG TOT CC 
8220 

GGGACGCCGG CTGGATGATC CCCAGCGCG GGGATCTCAT GCTGGAGTTC ^GCGCACC 
8280 

CCAACTTGTT TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA 
8340 

CAAATAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT 
8400 

CTTATCATGT CTGTATACCG TCGACCTCTA GCTAGAGCT, GGCGTAATCA TGGTCATAGC 
8460 

TGTTTCCTGT GTGAAATTGT TATCCGCTCA CAATTCCACA CAACATACGA GGCGGAAGCA 
8520 

TAAAGTGTAA AGCCTGGGGT GCCTAATGAG TGAGCTAACT CACATTAATT GCGTTGCGCT 
8580 

CACTGCCCGC TTTCCAGTCG GGAAACCTGT CGTGCCAGCT GCATTAATGA ATCGGCCAAC 
8640 

GCGCGGGGAG AGGCGGTTTG CGTATTGGGC GC T C TT CCGC TTCCTCGCC AC.GACTCGC 
8700 
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TGCGCTCGGT CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT 
8760 

TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG 
8820 

CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 
8880 

AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT 
8940 

ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC" TGTTCCGACC CTGCCGCTTA 
9000 

CCGGATACCT GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCACGCT 
9060 

GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC 
9120 

CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA 
9180 

GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG 
9240 

TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG 
9300 

TATTTGGTAT CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
9360 

GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA 
9420 

CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 
9480 



WO 98/13499 



- 144- 



PCT/EP97/05251 



AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCA.GAG ATTATCAAAA *»««T» 



9540 



cctagatcct tttaaattaa aaatgaagt, ttaaatcaat ctaaagtata tatgagtaaa 

9600 

CTI gg T ctga cagttacgaa tgcttaatca gtgaggcacg tatctcagcg atctgtctat 

9660 

TTCGTTCATC CAtAGTTGCG TGACTCGCCG TGGTGTAGAT AAGTACGATA CGGGAGGGCT 
9720 

TACCATCTGG CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 
9780 

TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT 
9840 

CGGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA 
9900 

ATAGTTTGCG CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 
9960 

GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACA^A TCCCGCA^T 
10020 

TGTGCAAAAA AGCGGTTAGC TGCTTCGGTC CTCCGATCGT TOTCAGAAGT AAGTTGGCCG 
10080 ' 

CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG 
10140 

TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC 
10200 

CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA 



GGCGACCGAG TTGCTCTTGC 
10260 
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CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC 
10320 

CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 
10380 

TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG 
10440 

GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA 
10500 

GCATTTATCA GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA 
10560 

AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGTC 
10610 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGTACACCGG ATCCGGCGCA CACC24 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 18: 

CACAACGAGC TCAATTAATT AATTGC C AC A TCCTC 

35 
G 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
Thr Leu Trp Thr 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Pro Ser Ala Ser Ala Ser Ala Ser Ala Pro Gly Ser 
1 5 10 
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We claim: 

, A packaging cell line expressing one or more adenovirus structural proteins, 
' polypeptides, orfragments .hereof, wherern said strucrura, protein is selected from the 

group consisting of: 

a. penton base; 

b. hexon; 

c. fiber; 

d. polypeptide Ma; 

e. polypeptide V; 

f . polypeptide VI; 

g. polypeptide VII; 

h. polypeptide VIII; and 

i biologically active fragments thereof. 

A packaging cell line according to claim 1. which supports the production of a viral 
vector. 

3 A packaging cell line according to claim 2, wherein the viral vector compnses a 
tripartite leader sequence or a sequence substantially homologous thereto. 

4 A packaging cell line according to any one of the preceding claims wherein sa.d 
structural protein is fiber and wherein said fiber protein has been modified to inc.ude a 
non-native amino acid residue sequence which targets a specific receptor, but wh.ch 
does not disrupt trimer formation or transport of fiber into the nucleus. 

5 A packaging cell line according to claim 4 wherein said non-native amino acid res.due 
sequence alters the binding specificity of the fiber for a targeted cell type. 
A packaging cell line according to. any one of the preceding claims wherein sa>d 
structural protein is fiber comprising amino acid residue sequences from more than one 



2. 



6. 



adenovirus serotype. 



A packaging cell line according to any one of claims 2 to 6, wherein said viral vector 
includes a nucleic acid sequence having a deletion or mutation of a DNA sequence 
encoding an adenovirus structural protein, polypeptide, or fragment thereof. 
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8. A packaging cell line according to claim 7 wherein said viral vector includes a nucleic 
acid sequence having a deletion or mutation of the DNA sequences encoding 
regulatory polypeptides El A and E1B. 

9. A packaging cell line according to claim 7, wherein said viral vector further includes a 
nucleic acid sequence having a deletion or mutation of a DNA sequence encoding one 
or more of the following regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, 
or fragments thereof. 

10. A packaging cell line according to claim 7, wherein a foreign DNA sequence encoding 
one or more foreign proteins, polypeptides or fragments thereof has been inserted in 
place of any of said deletions in said therapeutic viral vector. 

11. A packaging cell line according to claim 10, wherein said foreign DNA encodes a 
tumor-suppressor protein or a biologically active fragment thereof. 

12. A packaging cell line according to claim 10, wherein said foreign DNA encodes a 
suicide protein or a biologically active fragment thereof. 

13. A packaging cell line according to any one of the preceding claims, wherein said cell 
line is an epithelial cell line. 

14. A packaging cell line according to claim 12, wherein said cell line is selected from the 
group consisting of 293, A549, W162, HeLa, Vero, 21 1, and 211 A cell lines. 

15. A viral vector comprising deletion or mutation of a DNA sequence encoding an 
adenovirus structural protein, polypeptide, or fragment thereof, wherein said structural 
protein, polypeptide or fragment thereof is selected from the group consisting of: 

a. penton base; 

b. hexon; 

c. fiber; 

d. polypeptide Ilia; 

e. polypeptide V; 

f. polypeptide VI; 

g. polypeptide VII; 

h. polypeptide VIE; and 

i. biologically active fragments thereof. 
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16. 
17. 

18. 

19. 



20. 

21. 
22. 

23. 
24. 



A viral « t « according ro claim .5, fimher comprising dderion or muuuion of the 
DN A sequences encoding P*W— ElAandElB, or fragmems .hereof. 

A viral vecor acoordrng to e.aim 15. further comprising delerion or mutarion of the 
DNA sequence encoding one or more of Ire following regolarory prore.ns or 
polypeptides: E2A. E2B. E3. E4, U. or fragmenrs dtereof. 
A « vector according to claim .5 fnrrner comprising a foreign DNA seoue.ee 
inse „ed in place of the DNA sequence encoding said srrucrural prorein. polypept.de. or 

fragment thereof . . 
A vW vector according ronny one of Cairns 16. ,7 or ,8. further eomprrsmg forc.gn 
DNA sequences inserted in p.aoc of A. DNA sequences encoding said regularor, 
polypeptides or fragments thereof. 

A vim. vecror Uniting all or pan of a DNA sequence encoding adenovirus fiber protein, 
wherein said DNA sequence has been replaced by a forego DNA sequence encodmg . 
therapeutic molecule. 

A v,,.l vector having a muration in a DNA sequence encoding adenovirus fiber prorem. 
A viral vecror according <o any one of 15 ,0 21 wherein the vector is a rherapeuuc 
vector. 

A complement plasmid composing a promoter nucleor.de sequence operanvCy 
, inked to a nucleotide sequence encoding an adenovirus structural prorein. polypept.de. 
or fragment thereof. 

A complementing plasmid according to claim 23, further comprising a nucleoude 
sequence encoding: 

a a first adenovirus regulatory protein, polypeptide, or fragment thereof; or 

b . a second regulatory protein, polypeptide, or fragment thereof; or 

c. a third regulatory protein, polypeptide, or fragment thereof; or 
d any combination of the foregoing. 

A complementing plasmid according to claim 23. wherein said adenovirus structural 
protein or polypoplide is selected from the group consisting of: 
a. penton base; 
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b. hexon; 

c. fiber; 

d. polypeptide Ula; 

e. polypeptide V; 

f. polypeptide VI; 

g. polypeptide VII; 

h. polypeptide VIE; and 

i. biologically active fragments thereof. 

26. A complementing plasmid according to claim 24, wherein said regulatory proteins, 
polypeptides or fragments thereof are selected from the group consisting of E1A, E1B, 
E2A,E2B,E3,E4,andL4. 

27. A complementing plasmid comprising a promoter nucleotide sequence operatively 
linked to a nucleotide sequence encoding an adenovirus regulatory protein, 
polypeptide, or fragment thereof. 

28. A composition useful in the preparation of a therapeutic viral vectors, the composition 
comprising a cell containing a delivery plasmid comprising an adenovirus genome 
lacking a nucleotide sequence encoding fiber. 

29. A composition according to claim 28, wherein said delivery plasmid further comprises ; 
nucleotide sequence encoding a foreign polypeptide. 

30. A composition according to claim 29, wherein said polypeptide is a therapeutic 
molecule. 

31. A composition according to claim 28, wherein said delivery plasmid is selected from 
the group consisting of pDVl, p E1B gal, p ElsplB, and pFG140-f. 

32. A composition according to claim 28, wherein said cell further comprises a 
complementing plasmid containing a nucleotide sequence encoding fiber, said plasmid 
being stably integrated into the cellular genome of said cell. 

33. A composition according to claim 32, wherein said complementing plasmid has the 
characteristics of pCLF having ATCC Accession Number 97737. 

34. A composition useful in the preparation of therapeutic viral vectors, said composition 
comprising a cell containing: 
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„ . „,„„ of ne „ viral particles in the presence of said firs, delivery plasmrd. 

■ avaM wirhin said cell to produce a drerapeuric viral vecror. 

36 A composition according to claim 34, wherein said ce.l further compnses a 
Iplenrins plasnrid contains a nuclide sequence encodin g f.her. sard p,asmrd 
being stably inlegrated into .he cellular genome of sard cell. 

37 A J!JL according ,0 Cairn 34. wherein said ftrs, or second deltvery plasmrd 
rurdrercomprisea a nucleoride sequence encoding a foreign polypepnde. 

38 . A composirion according to claim 37, wherein said polypepude is a drerapeuuc 

molecule. . 

39 . A composition according .0 claim 34, wherein said ftrs. deliver plasmrd lacks 

adenovirus packaging signal sequences. 

40. A composirron according .0 Cairn 34, wherehr said second deliver plasmrd comams 

LacZ reporter construct. 
41 A composirion according ,o Cairn 34. wherein said second deliver plasmrd funher 

lacks a nucleoride sequence encoding an adenovims regulatoty prorem. 
A2 A composirion according ro Carm 41. wherein said regularory prorein ts El. 

43 AcomposirionaccordingroclaimSS.wheminsaidcomplemen.ingp.asmrdhasme 

characterises of pOF having ATCC Accession Number 97737. 

44 A composition according to Carm 34, wherein said firs, delivery plasmid lacks . 
nucleotide sequence encoding adenovims E4 prorern and said second dCvery p.nsnud 
iacks a nucleoride sequence encoding adenovims El prolan. 

45 A composirion according to claim 44, wherein said cell contains at lease one 
complements plasmid encoding an adenoviral regularory prorein and a smrcrum! 

4 , Virion accordrng ,0 claim 45. wherctn said regular prorein is E4 and said 
structural protein is fiber. 
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47. A composition according to claim 45, wherein said regulatory protein is El and said 
structural protein is fiber. 

48. A composition according to claim 45, wherein said regulatory protein is both El and 
E4 and said structural protein is fiber. 

49. A composition according to claim 45, wherein said adenoviral regulatory protein and 
said structural protein are encoded by separate complementing plasmids. 

50. A composition according to claim 45, wherein said cell is selected from the group 
consisting of 293, A549, W162, HeLa, Vero, 21 1, and 21 1 A. 
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